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Preface 


I am grateful to the readers of the first edition who have made suggestions 
for improvement. Apart from some minor corrections, the principal changes 
are as follows. 

The equation connecting the curvatures of four mutually tangent circles, 
now known as the Descartes Circle Theorem (p. 12), is proved along the lines 
suggested by Mr. Beecroft on pp. 91-96 of “The Lady’s and Gentleman’s 
Diary for the year of our Lord 1842, being the second after Bissextile, de- 
signed principally for the amusement and instruction of Students in Mathe- 
matics: comprising many useful and entertaining particulars, interesting to 
all persons engaged in that delightful pursuit.” 

For similarity in the plane, a new treatment (pp. 73-76) was suggested by 
A. L. Steger when he was a sophomore at the University of Toronto. For 
similarity in space, a different treatment (p. 103) was suggested by Professor 
Maria Wonenburger. A new exercise on p. 90 introduces the useful concept 
of inversive distarice. Another has been inserted on p. 127 to exhibit R. 
Krasnodebski’s drawings of symmetrical loxodromes. 

Pages 203-208 have been revised so as to clarify the treatment of affinities 
(which preserve collinearity) and equiaffinities (which preserve area). The 
new material includes some challenging exercises. For the discovery of finite 
geometries (p. 237), credit has been given to von Staudt, who anticipated 
Fano by 36 years. 

Page 395 records the completion, in 1968, by G. Ringel and J. W. T. 
Youngs, of a project begun by Heawood in 1890. The result is that we now 
know, for every kind of surface except the sphere (or plane), the minimal 
number of colors that will suffice for coloring every map on the surface. 

Answers are now given for practically all the exercises; a separate booklet 
is no longer needed. One of the prettiest answers (p. 453) was kindly sup- 
plied by Professor P. Sz4sz of Budapest. 


A.S. M. Coxeter 


Toronto, Canada 
January, 1969 


Preface to the 
first edition 


For the last thirty or forty years, most Americans have somehow lost in- 
terest in geometry. The present book constitutes an attempt to revitalize 
this sadly neglected subject. 

The four parts correspond roughly to the four years of college work. 
However, most of Part II can be read before Part J, and most of Part IV 
before Part III. The first eleven chapters (that is, Parts I and IX) will pro- 
vide a course for students who have some knowledge of Euclid and ele- 
mentary analytic geometry but have not yet made up their minds to spe- 
cialize in mathematics, or for enterprising high school teachers who wish to 
see what is happening just beyond their usual curriculum. Part III deals 
with the foundations of geometry, including projective geometry and hyper- 
bolic non-Euclidean geometry. Part IV introduces differential geometry, 
combinatorial topology, and four-dimensional Euclidean geometry. 

In spite of the large number of cross references, each of the twenty-two 
chapters is reasonably self-contained; many of them can be omitted on first 
reading without spoiling one’s enjoyment of the rest. For instance, Chapters 
1, 3, 6, 8, 13, and 17 would make a good short course. There are relevant 
exercises at the end of almost every section; the hardest of them are pro- 
vided with hints for their solution. (Answers to some of the exercises are 
given at the end of the book. Answers to many of the remaining exercises 
are provided in a separate booklet, available from the publisher upon re- 
quest.) The unifying thread that runs through the whole work is the idea 
of a group of transformations or, in a single word, symmetry. 

The customary emphasis on analytic geometry is likely to give students 
the impression that geometry is merely a part of algebra or of analysis. It 
is refreshing to observe that there are some important instances (such as 
the Argand diagram described in Chapter 9) in which geometrical ideas are 
needed as essential tools in the development of these other branches of 
mathematics. The scope of geometry was spectacularly broadened by Klein 
in his Erlanger Programm (Erlangen program) of 1872, which stressed the 
fact that, besides plane and solid Euclidean geometry, there are many other 
geometries equally worthy of attention. For instance, many of Euclid’s own 
propositions belong to the wider field of affine geometry, which is valid not 
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only in ordinary space -but also in Minkowski’s space-time, so successfully 
exploited by Einstein in his special theory of relativity. 

Geometry is useful not only in algebra, analysis, and cosmology, but also 
in kinematics and crystallography (where it is associated with the theory of 
groups), in statistics (where finite geometries help in the design of experi- 
ments), and even in botany. The subject of topology (Chapter 21) has been 
developed so widely that it now stands on its own feet instead of being re- 
garded as part of geometry; but it fits into the Erlangen program, and its 
early stages have the added appeal of a famous unsolved problem: that of 
deciding whether every possible map can be colored with four colors. 

The material grew out of courses of lectures delivered at summer insti- 
tutes for school teachers and others at Stillwater, Oklahoma; Lunenburg, 
Nova Scotia; Ann Arbor, Michigan; Stanford, California; and Fredericton, 
New Brunswick, along with several public lectures given to the Friends of 
Scripta Mathematica in New York City by invitation of the late Professor 
Jekuthiel Ginsburg. The most popular of these separate lectures was the 
one on the golden section and phyllotaxis, which is embodied in Chapter 11. 

Apart from the general emphasis on the idea of transformation and on 
the desirability of spending some time in such unusual environments as af- 
fine space and absolute space, the chief novelties are as follows: a simple 
treatment of the orthocenter (§ 1.6); the use of dominoes to illustrate six of 
the seventeen space groups of two-dimensional crystallography (§ 4.4); a 
construction for the invariant point of a dilative reflection (§ 5.6); a descrip- 
tion of the general circle-preserving transformation (§ 6.7) and of the spiral 
similarity (§ 7.6); an “explanation” of phyllotaxis (§ 11.5); an “ordered” 
treatment of Sylvester’s problem (§ 12.3); an economical system of axioms 
for affine geometry (§ 13.1); an “absolute” treatment of rotation groups 
(§ 15.4); an elementary treatment of the horosphere (§ 16.8) and of the ex- 
treme ternary quadratic form (§ 18.4); the correction of a prevalent error 
concerning the shape of the monkey saddle (§ 19.8); an application of geo- 
desic polar coordinates to the foundations of hyperbolic trigonometry 
(§ 20.6); the classification of regular maps on the sphere, projective plane, 
torus, and Klein bottle (§ 21.3); and the suggestion of a statistical honey- 
comb (§ 22.5). 

I offer sincere thanks to M. W. Al-Dhahir, J. J. Burckhardt, Werner Fen- 
chel, L. M. Kelly, Peter Scherk, and F. A. Sherk for critically reading various 
chapters; also to H G. Forder, Martin Gardner, and C. J. Scriba for their 
help in proofreading, to S. H. Gould, J. E. Littlewood, and J. L. Synge for 
permission to quote certain passages from their published works, and to 
M. C. Escher, I. Kitrosser, and the Royal Society of Canada for permission 
to reproduce the plates. 


AS. M. Coxeter 


Toronto, Canada 
March, 1961 
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Mathematics possesses not only truth, but supreme beauty 
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without appeal to any part of our weaker nature... 
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such as only the greatest art can show. 


BERTRAND RUSSELL (1872 - 1970) 
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Triangles 


In this chapter we review some of the well-known propositions of ele- 
mentary geometry, stressing the role of symmetry. We refer to Euclid’s 
propositions by his own numbers, which have been used throughout the 
world for more than two thousand years. Since the time of F. Commandino 
(1509-1575), who translated the works of Archimedes, Apollonius, and 
Pappus, many other theorems in the same spirit have been discovered. 
Such results were studied in great detail during the nineteenth century. As 
the present tendency is to abandon them in favor of other branches of 
mathematics, we shall be content to mention a few that seem particularly 
interesting. 


1.1 EUCLID 


Euclid's work will live long offer all the text-books of the present day 
are superseded and forgotten, It is one of the noblest monuments of 
antiquity. 


Sir Thomas L, Heath (1861 -1940)* 


About 300 B.c., Euclid of Alexandria wrote a treatise in thirteen books 
called the Elements. Of the author (sometimes regrettably confused with the 
earlier philosopher, Euclid of Megara) we know very little. Proclus (410- 
485 a.D.) said that he “put together the Elements, collecting many of Eu- 
doxus’s theorems, perfecting many of Theaetetus’s, and also bringing to ir- 
refragable demonstration the things which were only somewhat loosely 
proved by his predecessors. This man lived in the time of the first Ptolemy, 
[who] once asked him if there was in geometry any shorter way than that 
of the Elements, and he answered that there was no royal road to geometry.” 
Heath quotes a story by Stobaeus, to the effect that someone who had be- 
gun to read geometry with Euclid asked him “What shall I get by learning 
these things?” Euclid called his slave and said “Give him a dime, since he 
must make gain out of what he learns.” 

* Heath 1, p. vi. (Such references are collected at the end of the book, pp. 415-417.) 
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Of the thirteen books, the first six may be very briefly described as deal- 
ing respectively with triangles, rectangles, circles, polygons, proportion, and 
similarity. The next four, on the theory of numbers, include two notable 
achievements: I[X.2 and X.9, where it is proved that there are infinitely 
many prime numbers, and that 1/2 is irrational [Hardy 2, pp. 32-36]. 
Book XI is an introduction to solid geometry, XII deals with pyramids, 
cones, and cylinders, and XIII is on the five regular solids. 

According to Proclus, Euclid “set before himself, as the end of the whole 
Elements, the construction of the so-called Platonic figures.”” This notion 
of Euclid’s purpose is supported by the Platonic theory of a mystical cor- 
respondence between the four solids 


cube, earth, 
tetrahedron fire 

> >and the four “elements” bd 
octahedron, air, 
icosahedron water 


{cf. Coxeter 1, p. 18]. Evidence to the contrary is supplied by the arith- 
metical books VII-X, which were obviously included for their intrinsic in- 
terest rather than for any application to solid geometry. 


1.2 PRIMITIVE CONCEPTS AND AXIOMS 


“When | use a word,’’ Humpty-Dumpty said, “it means just what | 
choose if fo mean—neither more nor less.” 

Lewis Carroll (1832 -1898) 

[Dodgsan 2, Chap. 6] 


In the logical development of any branch of mathematics, each definition 
of a concept or relation involves other concepts and relations. Therefore 
the only way to avoid a vicious circle is to allow certain primitive concepts 
and relations (usually as few as possible) to remain undefined [Synge 1, pp. 
32-34]. Similarly, the proof of each proposition uses other propositions, 
and therefore certain primitive propositions, called postulates or axioms, 
must remain unproved. Euclid did not specify his primitive concepts and 
relations, but was content to give definitions in terms of ideas that would 
be familiar to everybody. His five Postulates are as follows: 


1.21 A straight line may be drawn from any point to any other point. 
1.22 A finite straight line may be extended continuously in a straight line. 
1.23 A circle may be described with any center and any radius. 

1.24 All right angles are equal to one another. 


1.25 9 Ifastraight line meets two other straight lines so as to make the two 
interior angles on one side of it together less than two right angles, the other 
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straight lines, if extended indefinitely, will meet on that side on which the an- 
gles are less than two right angles.* 


It is quite natural that, after a lapse of about 2250 years, some details are 
now seen to be capable of improvement. (For instance, Euclid J.1 con- 
structs an equilateral triangle by drawing two circles; but how do we know 
that these two circles will intersect?) The marvel is that so much of Euclid’s 
work remains perfectly valid. In the modern treatment of his geometry 
{see, for instance, Coxeter 3, pp. 161-187], it is usual to recognize the primi- 
tive concept point and the two primitive relations of intermediacy (the 
idea that one point may be between two others) and congruence (the idea 
that the distance between two points may be equal to the distance between 
two other points, or that two line segments may have the same length). 
There are also various versions of the axiom of continuity, one of which says 
that every convergent sequence of points has a limit. 

Euclid’s “principle of superposition,” used in proving 1.4, raises the ques- 
tion whether a figure can be moved without changing its internal structure. 
This principle is nowadays replaced by a further explicit assumption such 
as the axiom of “the rigidity of a triangle with a tail” (Figure 1.2a): 


1.26 If ABC is a triangle with D on the side BC extended, while D’ is 
analogously related to another triangle A’ B’C’, and if BC = B'C’, CA = C’A’, 
AB = A'B’, BD = B'D', then AD = A'D’. 


Figure 1.2a 


This axiom can be used to extend the notion of congruence from line seg- 
ments to more complicated figures such as angles, so that we can say pre- 
cisely what we mean by the relation 


ZLABC = Z A'BC. 


Then we no longer need the questionable principle of superposition in or- 
der to prove Euclid 1.4: 

If two triangles have two sides equal to two sides respectively, and have the 
angles contained by the equal sides equal, they will also have their third sides 
equal, and their remaining angles equal respectively, in fact, they will be con- 
gruent triangles. 


*In Chapter 15 we shall see how far we can go without using this unpleasantly complicated 
Fifth Pastulate. 
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1.3. PONS ASINORUM 


Minos: If is proposed fo prove |.5 by taking up the isosceles Triangle, 
turning if over, and then laying it down again upon itself. 

Euclid: Surely that has too much of the Irish Bull about it, and re- 
minds one a little too vividly of the man who walked down his own 
throat, to deserve a place in a strictly philosophical treatise? 

Minos: | suppose its defenders would say that it is conceived to leave 
a trace of itself behind, and that the reversed Triangle is laid down 
upon the trace so left. 


C. L. Dodgson (1832 -1898) 
[Dodgson 3, p. 48] 


1.5. The angles at the base of an isosceles triangle are equal. 

The name pons asinorum for this famous theorem probably arose from 
the bridgelike appearance of Euclid’s figure (with the construction lines re- 
quired in his rather complicated proof) and from the notion that anyone un- 
able to cross this bridge must be an ass. Fortunately, a far simpler proof 
was supplied by Pappus of Alexandria about 340 a.p. (Figure 1.3a): 


Figure 1.3a 


Let ABC be an isosceles triangle with AB equal to AC. Let us conceive this tri- 
angle as two triangles and argue in this way. Since AB = AC and AC= AB, the two 
sides AB, AC are equal to the two sides AC, AB. Also the angle BAC is equal to the 
angle CAB, for it is the same. Therefore all the corresponding parts (of the triangles 
ABC, ACB) are equal. In particular, 


Z ABC = Z ACB. 


The pedagogical difficulty of comparing the isosceles triangle ABC with 
itself is sometimes avoided by joining the apex A to D, the midpoint of the 
base BC. The median 4D may be regarded as a mirror reflecting B into C. 
Accordingly, we say that an isosceles triangle is symmetrical by reflection, 
or that it has bilateral symmetry. (Of course, the idealized mirror used in 
geometry has no thickness and is silvered on both sides, so that it not only 
reflects B into C but also reflects C into B.) 
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Any figure, however irregular its shape may be, yields a symmetrical fig- 
ure when we place it next to a mirror and waive the distinction between 
object and image. Such bilateral symmetry is characteristic of the external 
shape of most animals. 

Given any point P on either side of a geometrical mirror, we can con- 
struct its reflected image P’ by drawing the perpendicular from P to the 
mirror and extending this perpendicular line to an equal distance on the 
other side, so that the mirror perpendicularly bisects the line segment PP’. 
Working in the plane (Figure 1.35) with a line AB for mirror, we draw two 
circles with centers A, B and radii AP, BP. The two points of intersection 
of these circles are P and its image P’. 


B 
Figure 1.3b 


We shall find that many geometrical proofs are shortened and made more 
vivid by the use of reflections. But we must remember that this procedure 
is merely a short cut: every such argument could have been avoided by 
means of a circumlocution involving congruent triangles. For instance, the 
above construction is valid because the triangles ABP, ABP’ are congruent. 


Pons asinorum has many useful consequences, such as the following five: 


111.3. Ifa diameter of a circle bisects a chord which does not pass through the center, 
it is perpendicular to it; or, if perpendicular to it, it bisects it. 

III.20. Jn a circle the angle at the center is double the angle at the circumference, when 
the rays forming the angles meet the circumference in the same two points. 

IlI.21. ina circle, a chord subtends equal angles at any two points on the same one 
of the two arcs determined by the chord (e.g., in Figure 1.3c, 2 PQQ’ = ZPP'Q’). 

III.22. The opposite angles of any quadrangle inscribed in a circle are together equal 
to two right angles. 

11.32. Ifa chord of a circle be drawn from the point of contact of a tangent, the angle 
made by the chord with the tangent is equal to the angle subtended by the chord at a point 


on that part of the circumference which lies on the far side of the chord (€.g., in Figure 
13c, ZOTP’ = ZTPP’). 


We shall also have occasion to use two familiar theorems on similar tri- 
angles: 
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V1.2. Ifa straight line be drawn parallel to one side of a triangle, it will cut the other 
sides proportionately; and, if two sides of the triangle be cut proportionately, the line join- 
ing the points of section will be parallel to the remaining side. 

V1.4. If corresponding angles of two triangles are equal, then corresponding sides are 
proportional. 

Combining this last result with [11.21 and 32, we deduce two significant 
properties of secants of a circle (Figure 1.3c): 


Figure 1.3¢ 


III.35. Jf ina circle two straight lines cut each other, the rectangle contained by the 
segments of the one is equal to the rectangle contained by the segments of the other (i«., 
OP x OP’ = OO x OO’. 

111.36. Jf from a point outside a circle a secant and a tangent be drawn, the rectangle 
contained by the whole secant and the part outside the circle will be equal to the square 
on the tangent (i.e., OP x OP’ = OT). 


Book VI also contains an important property of area: 


VI.19. Similar triangles are to one another in the squared ratio of their corresponding 
sides (i.e., if ABC and A’B’C’ are similar triangles, their areas are in the ratio AB’ : 
A'B”), 

This result yields the following easy proof for the theorem of Pythagoras 
{see Heath 1, p. 353; 2, pp. 210, 232, 269]: 


147. Ina right-angled triangle, the square on the hypotenuse is equal to the sum of 
the squares on the two catheti. 


In the triangle ABC, right-angled at C, draw CF perpendicular to the 
hypotenuse AB, as in Figure 1.3d. Then we have three similar right-angled 
triangles ‘A BC, ACF, CBF, with hypotenuses AB, AC, CB. By VI.19, the 
areas Satisfy 


ABC _ ACF _ CBF 
AB? AC2~ CB?’ 
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Evidently, ABC = ACF + CBF. Therefore AB? = AC? 4 CB?. 


A 


Figure 1.3d 


EXERCISES 


1. Using rectangular Cartesian coordinates, show that the reflection in the y-axis 
(x = 0) reverses the sign of x. What happens when we reflect in the line x = y? 

2. Deduce I.47 from III.36 (applied to the circle with center A and radius AC). 

3. Inside a square ABDE, take a point C so that CDE is an isosceles triangle with 
angles 15° at Dand E. What kind of triangle is ABC? 

4. Prove the Erdés-Mordell theorem: If O is any point inside a triangle ABC and 
P, Q, R are the feet of the perpendiculars from O upon the respective sides BC, CA, 
AB, then 


OA + OB + OC > UOP + OO + OR). 


(Hint:* Let Py and P2 be the feet of the perpendiculars from R and Qupon BC. De- 
fine analogous points Q; and Qz, Ri and R2 on the other sides. Using the similarity 
of the triangles PRP, and OBR, express P,P in terms of RP, OR, and OB. After sub- 
stituting such expressions into 


OA + OB + OC > OA(P;P + PP2)/RO + OB(Q10 + QQ2)/PR 
+ OC(RiR + RR2)/OP, 


collect the terms involving OP, OQ, OR, respectively.) 
5. Under what circumstances can the sign > in Ex. 4 be replaced by =? 
6. In the notation of Ex. 4, 
OA x OB x OC > (OE + ORKOR + OP\OP + OQ). 


(A. Oppenheim, American Mathematical Monthly, 68 (1961), p. 230. See also L. J. 
Mordell, Mathematical Gazette, 46 (1962), pp. 213-215.) 


7. Prove the Steiner-Lehmus theorem: Any triangle having two equal internal angle 
bisectors (each measured from a Vertex to the opposite side) is isosceles. (Hint:} If a 
triangle has two different angles, the smaller angle has the longer internal bisector.) 


* Leon Bankoff, American Mathematical Monthly, 68 (1958), p. 521. For other proofs see G. R. 
Veldkamp and H. Brabant, Nieuw Tijdschrift voor Wiskunde, 45 (1958), pp. 193-196; 46 (1959), 
p. 87. 

} Court 2, p. 72, For Lehmus’s proof of 1848, see Coxeter and Greitzer 1, p. 15. 
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1.4 THE MEDIANS AND THE CENTROID 


Oriental mathematics may be an interesting curiosity, but Greek mathe- 
matics is the real thing. ... The Greeks, as Littlewood said fo me once, 
are not clever schoolboys or ‘'scholarship candidates,’ but ‘Fellows of 
another college." So Greek mathematics is ‘‘permanent,’’ more per- 
manent even than Greek literature. Archimedes will be remembered 
when Aeschylus is forgotten, because languages die and mathematical 
ideas do not. 


G. H. Hardy (1877 -1947) 
{Hardy 2, p. 21] 


Figure 1.4a 


The line joining a vertex of a triangle to the midpoint of the opposite side 
is called a median. 

Let two of the three medians, say BB’ and CC’, meet in G (Figure 1.4a). 
Let Land M be the midpoints of GB and GC. By Euclid VI.2 and 4 (which 
were quoted on page 8), both C’B’ and LM are parallel to BC and half as 
long. Therefore B’C’LM isa parallelogram. Since the diagonals of a paral- 
lelogram bisect each other, we have 


B’G = GL = LB, C’'G = GM = MC. 


Thus the two medians BB’, CC’ trisect each other at G. In other words, 
this point G, which could have been defined as a point of trisection of one 
median, is also a point of trigection of another, and similarly of the third. 
We have thus proved [by the method of Court 1, p. 58] the following 
theorem: 


1.41 The three medians of any triangle all pass through one point. 


This common point G of the three medians is called the centroid of the 
triangle. Archimedes (c. 287-212 B.c.) obtained it as the center of gravity of 
a triangular plate of uniform density. 


TRITANGENT CIRCLES 1 


EXERCISES 
Any triangle having two equal medians is isosceles.* 


2. The sum of the medians of a triangle lies between 3p and p, where p is the sum 
of the sides. [Court I, pp. 60-61.] 


1. 


1.5 THE INCIRCLE AND THE CIRCUMCIRCLE 


Alone at nights, 


! read my Bible more and Euclid less. 


Robert Buchanan (1841 -1901) 
(An Old Dominie's Story) 


I a 
Figure 1.5a 


Euclid III.3 tells us that a circle is symmetrical by reflection in any diame- 
ter (whereas an ellipse is merely symmetrical about two special diameters: 
the major and minor axes). It follows that the angle between two inter- 
secting tangents is bisected by the diameter through their common point. 


*It is ta be understoad that any exercise appearing in the form of a theorem is intended to be 
proved. It saves space to amit the words “Prave that” or “Show that.” 
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By considering the loci of points equidistant from pairs of sides of a tri- 
angle ABC, we see that the internal and external bisectors of the three 
angles of the triangle meet by threes in four points J, In, Ih, I,, as in Figure 
1.5a. These points are the centers of the four circles that can be drawn to 
touch the three lines BC, CA, AB. One of them, the incenter I, being inside 
the triangle, is the center of the inscribed circle or incircle (Euclid IV.4). 
The other three are the excenters I,, Ip, I;: the centers of the three escribed 
circles or excircles [Court 2, pp. 72-88]. The radii of the incircle and excir- 
cles are the inradius r and the exradii rg, rp, re. 
In describing a triangle ABC, it is customary to call the sides 


a= BC, b = CA, c = AB, 
the semiperimeter 
s=t}(a+b4o), 
the angles A, B, C, and the area A. 
Since A + B+ C = 180°, we have 
1.51 ZBIC = 90° + 4A, 


a result which we shall find useful in § 1.9. 
Since IBC is a triangle with base a and height r, its area is 4ar. Adding 
three such triangles we deduce 


A=t}(a+ 64 c)r= sr. 
Similarly A = (6 + c — a)rag = (S — a)ra. Thus 
1.52 A = sr = (s ~ a)rq = (Ss — b)ry = (8S — C)re. 
From the well-known formula cos A = (b? 4+ c? — a?)/2bc, we find also 
sin A = [—at — b* — c# 4 2b2c2 4 2c2a2 4. 2a2b?}!/2bc, 
whence 


A=4bcsnA 

4[— at — bt — ct + Db2c? 4 2c2q?2 4 2a2b2}} 
f;}{a+b+oef(-—a+b4cla—b+cla+b—-o} 

[s(s — ay(s — bys — c)}. 

This remarkable expression, which we shall use in § 18.4, is attributed to 
Heron of Alexandria (about 60 A.D.), but it was really discovered by Archi- 
medes. (See B. L. van der Waerden, Science Awakening, Oxford University 
Press, New York, 1961, pp. 228, 277.) Combining Heron’s formula with 
1.52, we obtain 


1.53172 =(4) = 8 = as — bYs = 0) p2 =(4\" SoG.) 
Ss S \ 


S—a@ sS—a 


1.53 


Another consequence of the symmetry of a circle is that the perpendicular 
bisectors of the three sides of a triangle all pass through the circumcenter O, 


TH 


m 
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Figure 1.5b Figure 1.5¢ 


which is the center of the circumscribed circle or circumcircle (Euclid IV.5). 
This is the only circle that can be drawn through the three vertices A, B, C. 
Its radius R is called the circumradius of the triangle. Since the “angle at the 
center,” 4 BOC (Figure 1.55), is double the angle A, the congruent right- 
angled triangles OBA’, OCA’ each have an angle A at O, whence 


RsinA = BA’ = ha, 


b c 
1.54 1 = . 
snA sinB sinC 


Draw AD perpendicular to BC, and let AK be the diameter through A 
of the circumcircle, as in Figure 1.5c. By Euclid IT1.21, the right-angled tri- 
angles ABD and AKC are similar; therefore 


AD _ AC _ be 
AB” AK’ 2R- 
Since A = 4BC x AD, it follows that 


1.55 4AR = abc 


s(s — b\(s — c) + s(s — c)(s — a) + s(s — a)(s — 5) 
— (s — ays — bls — c) 


es Pg 4 _ A 
S—-a s—b g-e Ss 


=A(rat ry +re—/r). 


Hence the five radii are connected by the formula 


1.56 4Rarn+nytr—r. 


Let us now consider four circles E1, E2, E3, Es, tangent to one another at 
six distinct points. Each circle E; has a bend «;, defined as the reciprocal of 
its radius with a suitable sign attached, namely, if all the contacts are ex- 
ternal (as in the case of the light circles in Figure 1.5d), the bends are all 
positive, but if one circle surrounds the other three (as in the case of the 
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Figure 1.5d 


heavy circles) the bend of this largest circle is taken to be negative; and a line 
counts as a circle of bend 0. In any case, the sum of all four bends is positive. 

In a letter of November 1643 to Princess Elisabeth of Bohemia, René 
Descartes developed a formula relating the radii of four mutually tangent 
circles. In the “bend” notation it is ; 


1.57 2(e1? + €22 4 €32 4 €42) = (€1 + €2 + €3 + €4)?. 


Figure 1.5e Figure 1.5f 


This Descartes circle theorem was rediscovered in 1842 by an English ama- 
teur, Philip Beecroft, who observed that the four circles E; determine another 
set of four circles Hi, mutually tangent at the same six points: H, through 
the three points of contact of Es, Es, E4, and so on. Let y; denote the bend 
of H;. If the centers of Ei, Es, E3 form a triangle ABC, Hz, is either the 


tmnivala arian aveirela In tha farme cace (Fioure 1 Se). 
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1.58 _ 1 _ 1 —_ 1 a es uf 

cage ng ee ge ge ey 
In the latter (Figure 1.5/), 

1 

a= ~Le=sit, asp wats 
In either case, we see from 1.531 that 

€9€3 + €3€1 + €1€2 = (E+ = ei 14) £1£0&3 = Ta”. 

£3 


Similarly nons + 13% + N1n2 = €42, and of course we can permute the sub- 
scripts 1,2,3,4. Hence 


(Ze;)? = £42 Sa €47 + 2€1€2 +...4 26384 = De;? + =Hi2. 
Since this expression involves ¢; and y; symmetrically, it is also equal to 
(271i)? thus 
ep t+ eg t+e3+&4 =m + Ne + 93 + N4 > 0. 


Also, since 


(€1 + €2 + €3 — e4ler + €2 + €3 + €4) = (€1 + €2 + €3)? — 

= €y7 4 €0? 4 £3" — e4? + 247 
(non3 + Nana + N3aN4) + (MiNs +...) + (M2 +...) — (ae +. -) + 204? 
= 2(nina + Nona + N34) + 2n4? = 2n4(M1 + Te + 03 + Na), 


1.59 €y + €2 + € — & = 24. 


Adding four such equations after squaring each side, we deduce ej? = Xn, 
whence 


22e;2 = De? + =yi? = (2e;)?. 
Thus 1.57 has been proved. 


In 1936, this theorem was rediscovered again by Sir Frederick Soddy, who 
had received a Nobel prize in 1921 for his discovery of isotopes. He ex- 
pressed the theorem in the form of a poem, The Kiss Precise*, of which the 
middle verse runs as follows: 


Four circles to the kissing come, 

The smaller are the benter. 

The bend is just the inverse of 

The distance from the centre. 

Though their intrigue left Euclid dumb 


* Nature, 137 (1936), p. 1021; 139 (1937), p. 62. In the next verse, Soddy annaunced his dis- 
covery of the analagous formula for 5 spheres in 3 dimensions. A final verse, added by Thorold 
Gosset (1869-1962) deals with n4+2 spheres in n dimensions; see Coxeter, Aequationes Mathe- 
maticae, 1 1968), pp. 104-121. 
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There’s now no need for rule of thumb. 
Since zero bend’s a dead straight line 
And concave bends have minus sign, 
The sum of the squares of all four bends 
Is half the square of their sum. 


EXERCISES 


1. Find the locus of the image ofa fixed point P by reflection in a variable line 
through another fixed point O. 
2. For the general triangle ABC, establish the identities 
es + = + L ms Tralsle = A. 
Ta Th Ve 
3. The lengths of the tangents from the vertex 4 to the incircle and to the three 
excircles are respectively 


S-a % s—c, s—b, 


4. The circumcenter of an obtuse-angled triangle lies outside the triangle. 
5. Where is the circumcenter of a right-angled triangle? 


6. Let U, V, W be three points on the respective sides BC, CA, AB of a triangle 
ABC. The perpendiculars to the sides at these points are concurrent if and only if 


AW2 4+ BU2 + CV? = WB? + UC? + VA?. 


7. A triangle is right-angled if and only if r+2R=s. 
8. The bends of Beecroft’s eight circles satisfy 


f&1 + 1 = &2 + Ye = &3 + 43 = €4 + Ya, DEN, = O. 


9. For any four numbers satisfying k + /+ m+n = 0, there is a “Beecroft con- 
figuration” having bends 


a= kk + /)),e= (k + Di, €3 = n? — kl &4 = m? — kl, 
m1 = 22 — mn, 42 = k? — mn, 43 = m(m +n), Na = (mM + Nn. 


(Hint: Express es, €4, 71, N2 aS rational functions of e1, &2, 73, 4.) 


10. If three circles, externally tangent to one another, have centers forming a tri- 
angle ABC, they are all tangent to two other circles (or possibly a circle and a line) 
whose bends are 

r+4R 2s 
A 

11. Given a point P on the circumcircle of a triangle, the feet of the perpendiculars 
from P to the three sides all lie on a straight line. (This line is commonly called the Sim- 
son line of P with respect to the triangle, although it was first mentioned by W. Wallace, 
thirty years after Simson’s death [Johnson 1, p. 138].) 

12. Given a triangle ABC and a point P in its plane (but not on a side nor on the cir- 
cumcircle), let .41B,C, be the derived triangle formed by the feet of the perpendiculars 
from P to the sides BC, CA, AB. Let A2BzCz be derived analogously from A,B,C 
(using the same P), and A3B3C3 from A2B2Cz. Then A3B3C; is directly similar to ABC. 
[Casey 1, p. 253.] (Hint: ZPBA = Z2PAiC, = 2PCoB2 = 2 PB3A3.) This result has 
been extended by B. M. Stewart from the third derived triangle of a triangle to the nth 
derived n-gon of an n-gon. (American Mathematical Monthly 47 (1940), pp. 462-466). 
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1.6 THE EULER LINE AND THE ORTHOCENTER 


Although the Greeks worked fruitfully, not only in geometry but also in 

the most varied fields of mathematics, nevertheless we today have gone 
beyond them everywhere and certainly also in geometry. 

F. Klein (1849 -1925) 

[Klein 2, p. 189] 


From now on, we shall have various occasions to mention the name of L. 
Euler (1707-1783), a Swiss who spent most of his life in Russia, making im- 
portant contributions to all branches of mathematics. Some of his simplest 
discoveries are of such a nature that one can well imagine the ghost of Euclid 
saying, “Why on earth didn’t I think of that?” 


Figure 1.6a 


If the circumcenter O and centroid G of a triangle coincide, each median 
is perpendicular to the side that it bisects, and the triangle is “isosceles three 
ways,” that is, equilateral. Hence, if a triangle A BC is not equilateral, its 
circumcenter and centroid lie on a unique line OG. On this so-called Euler 
line, consider a point H such that OH = 30G, that is, GH = 20G (Figure 
1.6a). Since also GA = 2A’G, the latter half of Euclid V1.2 tells us that AH 
is parallel to A’O, which is the perpendicular bisector of BC. Thus AH is 
perpendicular to BC. Similarly BH is perpendicular to CA, and CH to AB. 

The line through a vertex perpendicular to the opposite side is called an 
altitude. The above remarks [cf. Court 2, p. 101] show that 


The three altitudes of any triangle all pass through one point on the Euler line. 
This common point H of the three altitudes is called the orthocenter of the 
triangle. 
EXERCISES 


1. Through each vertex of a given triangle ABC draw a line parallel to the opposite 
side. The perpendicular bisectors of the sides of the triangle so formed suggest an alter- 
native proof that the three altitudes of ABC are concurrent. 


'2. The orthocenter of an obtuse-angled triangle lies outside the triangle. 
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3. Where is the orthocenter of a right-angled triangle? 
4. Any triangle having two equal altitudes is isosceles. 


5. Construct an isosceles triangle A BC (with base BC), given the median BB’ and 
the altitude BE. (Hint: The centroid is two-thirds of the way from B to B’.) (H. Freu- 
denthal.) 

6. The altitude AD of any triangle ABC is of length 


2R sin B sin C. 


7. Find the perpendicular distance from the centroid G to the side BC. 


8. If the Euler line passes through a vertex, the triangle is either right-angled or 
isosceles (or both). 


9, If the Euler line is parallel to the side BC, the angles B and C satisfy 
tan B tan C = 3. 


1.7. THE NINE-POINT CIRCLE 


This circle is the first really exciting one fo appear in any course on 
elementary geometry. 

Daniel Pedoe (1910 - ) 

[Pedoe 1, p. 1] 


Figure 1.7a 


The feet of the altitudes (that is, three points like D in Figure 1.6a) form 
the orthic triangle (or “pedal triangle”) of ABC. The circumcircle of the 
orthic triangle is called the nine-point circle (or “Feuerbach circle”) of the 
original triangle, because it contains not only the feet of the three altitudes 
but also six other significant points. In fact, : 


1.71. The midpoints of the three sides, the midpoints of the lines joining the 
orthocenter to the three vertices, and the feet of the three altitudes, alllie ona 
circle. 
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Proof [Coxeter 2, 9.29]. Let A’, B’, C) A”, B”, C” be the midpoints of BC, 
CA, AB, HA, HB, HC, and let D, E, F be the feet of the altitudes, as in Fig- 
ure 1.7a. By Euclid VI.2 and 4 again, both C’B’ and B”’C” are parallel to 
BC while both B’C” and C’B” are parallel to AH. Since AH is perpendicular 
to BC, it follows that B’C’B’C” is a rectangle. Similarly C’A’C’A” is a 
rectangle. Hence A’A”, B’B”, C’C” are three diameters of a circle. Since 
these diameters subtend right angles at D, E, F, respectively, the same circle 
passes through these points too. 


If four points in a plane are joined in pairs by six distinct lines, they are 
called the vertices of a complete quadrangle, and the lines are its six sides. 
Two sides are said to be opposite if they have no common vertex. Any point 
of intersection of two opposite sides is called a diagonal point. There may be 
as many as three such points (see Figure 1.75). 


B Cc 


Figure 1.7b 


If a triangle ABC is not right-angled, its vertices and orthocenter form a 
special kind of quadrangle whose opposite sides are perpendicular. In this 
terminology, the concurrence of the three altitudes can be expressed as fol- 
lows: 


1.72 Iftwo pairs of opposite sides of a complete quadrangle are pairs of 
perpendicular lines, the remaining sides are likewise perpendicular. 


Such a quadrangle A BCA is called an orthocentric quadrangle. Its six sides 
BC, CA, AB, HA, HB, HC 
are the sides and altitudes of the triangle A BC, and its diagonal points D, E, 


F are the feet of the altitudes. Among the four vertices of the quadrangle, 
our notation seems to give a special role to the vertex H. Clearly, however, 


1.73 Each vertex of an orthocentric quadrangle is the orthocenter of the 
triangle formed by the remaining three vertices. 


The four triangles (just one of which is acute-angled) all have the same 
orthic triangle and consequently the same nine-point circle. 

It is proved in books on affine geometry [such as Coxeter 2, 8.71] that the 
midpoints of the six sides of any complete quadrangle and the three diagonal 
points all lie on aconic. The above remarks show that, when the quadrangle 
is orthocentric, this “nine-point conic” reduces to a circle. 
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EXERCISES 

1. Of the nine points described in 1.71, how many coincide when the triangle is 
(a) isosceles, (b) equilateral? 

2. The feet of the altitudes decompose the nine-point circle into three arcs. If the 
triangle is scalene, the remaining six of the nine points are distributed among the three 
arcs as follows: One arc contains just one of the six points, another contains two, and 
the third contains three. 

3. On the arc A’D of the nine-point circle, take the point X one-third of the way 
from A’ to D. Take points Y, Z similarly, on the arcs B’E, C’F. Then X YZ is an equi- 
lateral triangle. 

4. The incenter and the excenters of any triangle form an orthocentric quadrangle. 
[Casey 1, p. 274.] 

5. In the notation of § 1.5, the Euler line of J,/,/, is JO. 

6. The four triangles that occur in an orthocentric quadrangle have equal circum- 
radii. 


1.8 TWO EXTREMUM PROBLEMS 


Mast peaple have same appreciatian af mathematics, just as mast 
peaple can enjay a pleasant tune; and there are prabably mare peaple 


really interested in mathematics than in music. 


G.H. Hardy (2, p. 26] 


Their interest will be stimulated if anly we can eliminate the aversian 

taward mathematics that sa many have acquired fram childhaad ex- 
periences. 

Hans Rademacher (1892 - ) 

[Rademacher and Toeplitz 1, p. 5] 


We shall describe the problems of Fagnano and Fermat in considerable 
detail because of the interesting methods used in solving them. The first was 
proposed in 1775 by J. F. Toschi di Fagnano, who solved it by means of 
differential calculus. The method given here was discovered by L. Fejér 
while he was a student [Rademacher and Toeplitz 1, pp. 30-32]. 


FAGNANO'S PROBLEM. In a given acute-angled triangle ABC, inscribe a 
triangle UVW whose perimeter is as small as possible. 

Consider first an arbitrary triangle UV W with U on BC, V on CA, W on 
AB. Let U’, U” be the images of U by reflection in CA, AB, respectively. 
Then 

UV+VW4+ WU=UV+ YW WU, 
which is a path from U’to U”, usually a broken line with angles at Vand W. 
Such a path from U’ to U” is minimal when it is straight, as in Figure 1.8a. 


FAGNANO'S PROBLEM 2! 


Hence, among all inscribed triangles with a given vertex U on BC, the one 
with smallest perimeter occurs when V and W lie on the straight line U’U”. 
In this way we obtain a definite triangle UV W for each choice of U on BC. 
The problem will be solved when we have chosen U so as to minimize U’U”, 
which is equal to the perimeter. 


Figure 1.8a 


Since AU’ and AU” are images of AU by reflection in AC and AB, they 

are congruent and 

LUAU" = 2A. 

Thus AU’U” is an isosceles triangle whose angle at A is independent of the 
choice of U. The base U’U” is minimal when the equal sides are mini- 
mal, that is, when AU is minimal. In other words, AU is the shortest dis- 
tance from the given point A to the given line BC. Since the hypotenuse of 
a right-angled triangle is longer than either cathetus, the desired location of 
U is such that A U is perpendicular to BC. Thus AUis the altitude from A. 

This choice of U yields a unique triangle UV W whose perimeter is smaller 
than that of any other inscribed triangle. Since we could equally well have 
begun with B or C instead of A, we see that BV and CW are the altitudes 
from B and C. Hence 

The triangle of minimal perimeter inscribed in an acute-angled triangle ABC 
is the orthic triangle of A BC. 

The same method can be used to prove the analogous result for spherical 
triangles [Steiner 2, p. 45, No. 7]. 

The other problem, proposed by Pierre Fermat (1601-1665), likewise seeks 
to minimize the sum of three distances. The solution given here is due to J. 
E. Hofmann.* 

FERMAT’S PROBLEM. Jn a given acute-angled triangle ABC, locate a point 
P whose distances from A, B, C have the smallest possible sum. 

Consider first an arbitrary point P inside the triangle. Join it to A, B, C 
and rotate the inner triangle APB through 60° about B to obtain C’P’B, so 
that ABC’ and PBP’ are equilateral triangles, as in Figure 1.8). Then 

AP + BP + CP = CP’ 4+ PP + PC, 
a Elementare Lésung einer Minimumsaufgabe, Zeitschrift fiir mathematischen und naturwis- 
senschaftlichen Unterricht, 60 (1929), pp. 22-23. 
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Figure 1.8b 


which is a path from C’ to C, usually a broken line with angles at P’ and P. 
Such a path Goining C’ to C by a sequence of three segments) is minimal 
when it is straight, in which case 

ZBPC = 180° — Z BPP’ = 120° 
and ZLAPB = ZC'P’B = 180° — ZPP’B = 120°. 

Thus the desired point P, for which AP + BP + CP is minimal, is the 
point from which each of the sides BC, CA, AB subtends an angle of 120°. 
This “Fermat point” is most simply constructed as the second intersection of 
the line CC’ and the circle A BC’ (that is, the circumcircle of the equilateral 
triangle ABC’). 

It has been pointed out [for example by Pedoe 1, pp. 11-12] that the tri- 
angle ABC need not be assumed to be acute-angled. The above solution is 
valid whenever there is no angle greater than 120°. 

Instead of the equilateral triangle ABC’ on AB, we could just as well have 
drawn an equilateral triangle BCA’ on BC, or CAB’ on CA, as in Figure 1.8c. 
Thus the three lines AA’, BB’, CC’ all pass through the Fermat point P, and 
any two of them provide an alternative construction for it. Moreover, the 
line segments AA’, BB’, CC’ are all equal to AP + BP + CP. Hence 


If equilateral triangles BCA’, CAB’, ABC ‘are drawn outwards on the sides 
of any triangle ABC, the line segments AA’, BB’, CC’ are equal, concurrent, 
and inclined at 60° to one another. 


Figure 1.8c¢ 
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EXERCISES 


1. In Figure 1.84, UV and VW make equal angles with CA. Deduce that the ortho- 
center of any triangle is the incenter of its orthic triangle. (In other words, if 4BCis a 
triangular billiard table, a ball at U, hit in the direction UV, will go round the triangle 
UVW indefinitely, that is, until it is stopped by friction.) 

2. How does Fagnano’s problem collapse when we try to apply it to a triangle ABC 
in which the angle A is obtuse? 

3. The circumcircles of the three equilateral triangles in Figure 1.8c all pass through 
P, and their centers form a fourth equilateral triangle.* 

4. Three holes, at the vertices of an arbitrary triangle, are drilled through the top of 
a table. Through each hole a thread is passed with a weight hanging from it below the 
table. Above, the three threads are all tied together and then released. If the three 
weights are all equal, where will the knot come to rest? 

5. Four villages are situated at the vertices of a square of side one mile. The in- 
habitants wish to connect the villages with a system of roads, but they have only enough 
material to make 3 + | miles of road. How do they proceed? [Courant and Rob- 
bins 1, p. 392.] 

6. Solve Fermat’s problem for a triangle ABC with A > 120°, and for a convex 
quadrangle ABCD. 

7. If two points P, P’, inside a triangle ABC, are so situated that 2 CBP = Z PBP’ 
= ZP'BA, ZACP = ZP’CP = £PCB, then ZBP’P = ZPP'C. 

8. If four squares are placed externally (or internally) on the four sides of any 
parallelogram, their centers are the vertices of another square. [Yaglom 1, pp. 96-97.] 

9. Let X, Y, Z be the centers of squares placed externally on the sides BC, CA, AB 
of a triangle ABC. Then the segment AX is congruent and perpendicular to YZ (also 
BY to ZX and CZ to XY). (W. A. J. Luxemburg.) 

10. Let Z, X, U, V be the centers of squares placed externally on the sides AB, BC, 
CD, DA of any simple quadrangle (or “quadrilateral”) ABCD. Then the segment ZU 
(joining the centers of two “opposite” squares) is congruent and perpendicular to XV. 
[Forder 2, p. 40.] 


1.9 MORLEY’S THEOREM 


Many of the praafs in mathematics are very lang and intricate. Others, 
thaugh nat lang, are very ingeniausly canstructed. 
E. C. Titchmarsh (1899 -1963} 
[Titchmarsh 1, p. 23] 


One of the most surprising theorems in elementary geometry was discov- 
ered about 1899 by F. Morley (whose son Christopher wrote novels such as 
Thunder on the Left). He mentioned it to his friends, who spread it over 


* Court [1, pp. 105-107]. See also Mathesis 1938, p. 293 (footnote, where this theorem is attrib- 
uted to Napoleon); and Forder [2, p. 40] for some interesting generalizations. 
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the world in the form of mathematical gossip. At last, after ten years, a 
trigonometrical proof by M. Satyanarayana and an elementary proof by 
M. T. Naraniengar were published.* 


Figure 1.9a 


MORLEY’S THEOREM. The three points of intersection of the adjacent tri- 
sectors of the angles of any triangle form an equilateral triangle. 


In other words, any triangle ABC yields an equilateral triangle POR if 
the angles A, B, C are trisected by AQ and AR, BR and BP, CP and CQ, 
as in Figure 1.9a. (Much trouble is experienced if we try a direct approach, 
but the difficulties disappear if we work backwards, beginning with an equi- 
lateral triangle and building up a general triangle which is afterwards iden- 
tified with the given triangle ABC.) 

On the respective sides QR, RP, PQ of a given equilateral triangle POR, 
erect isosceles triangles P’QR, Q’RP, R’PQ whose base angles a, B, y satisfy 
the equation and inequalities — 


a+tB+t+y=120°7, a<60°, B<60°, y< 60°. 


* Mathematical Questions and their Solutions from the Educational Times (New Series), 15 
(1909), pp. 23-24, 47. See also C. H. Chepmell and R. F. Davis, Mathematical Gazette, 11 (1923), 
pp. 85-86; F. Morley, American Journal of Mathematics, 51 (1929), pp. 465-472, H. D. Gross- 
man, American Mathematical Monthly, 50 (1943), p. 552, and L. Bankoff, Mathematics Maga- 
zine, 35 (1962), pp. 223-224. The treatment given here is due to Raoul Bricard, Nouvelles An- 
nales de Mathématiques (5), 1 (1922), pp. 254-258. A similar proof was devised independently 
by Bottema [1, p. 34]. 
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Extend the sides of the isosceles triangles below their bases until they meet 
again in points A, B, C. Sincea + B + y + 60° = 180°, we can imme- 
diately infer the measurement of some other angles, as marked in Figure 
1.9a. For instance, the triangle AQR must have an angle 60° — a at its 
vertex A, since its angles at Q and R area + Bandy + a. 

Referring to 1.51, we see that one way to characterize the incenter J of a 
triangle ABC is to describe it as lying on the bisector of the angle A at such 
a distance that 


ZBIC = 90° 444. 


Applying this principle to the point P in the triangle P’BC, we observe that 
the line PP’ (which is a median of both the equilateral triangle PQR and 
the isosceles triangle P’QR) bisects the angle at P’’ Also the half angle at 
P’ is 90° — a, and 


Z BPC = 180° — a = 90° + (90° — a). 


Hence P is the incenter of the triangle P’BC. Likewise Q is the incenter of 
Q’CA, and R of R'AB. Therefore all the three small angles at C are equal; 
likewise at A and at B. In other words, the angles of the triangle ABC are 
trisected. 

The three small angles at A are each 4A = 60° — a; similarly at B and 
Cc. Thus 


a = 60° —44, B= 60° —4B, y= 60° —4C. 


és 
By choosing these values for the base angles of our isosceles triangles, we 
can ensure that the above procedure yields a triangle ABC that is similar 
to any given triangle. 
This completes the proof. 


EXERCISES 


1. The three lines PP’, QQ’, RR’ (Figure 1.92) are concurrent. In other words, 
the trisectors of A, B, C meet again to form another triangle P’Q’R’ which is perspec- 
tive with the equilateral triangle POR. (In general P’Q’R’ is not equilateral.) 

2. What values of a, B, y will make the triangle ABC (i) equilateral, (ii) right- 
angled isosceles? Sketch the figure in each case. 

3. Let Py and P2 (on CA and AB) be the images of P by reflection in CP’ and BP’. 
Then the four points P,, Q, R, Po are evenly spaced along a circle through A. In the 
special case when the triangle ABC is equilateral, these four points occur among the 
Vertices of a regular enneagon (9-gon) in which A is the vertex opposite to the side OR. 
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Regular Polygons 


We begin this chapter by discussing (without proofs) the possibility of 
constructing certain regular polygons with the instruments allowed by 
Euclid. We then consider all these polygons, regardless of the question of 
constructibility, from the standpoint of symmetry. Finally, we extend the 
concept of a regular polygon so as to include star polygons. 


2.1 CYCLOTOMY 


One, twal One, twa! And thraugh and through 
The varpol blade went snicker-snack! 

Lewis Corroll| 

[Dodgson 2, Chap. 1] 


Euclid’s postulates imply a restriction on the instruments that he allowed 
for making constructions, namely the restriction to ruler (or straightedge) 
and compasses. He constructed an equilateral triangle (1.1), a square (IV.6), 
a regular pentagon (IV.11), a regular hexagon (IV.15), and a regular 15-gon 
(IV.16). The number of sides may be doubled again and again by repeated 
angle bisections. It is natural to ask which other regular polygons can be 
constructed with Euclid’s instruments. This question was completely an- 
swered by Gauss (1777-1855) at the age of nineteen [see Smith 2, pp. 301- 
302]. Gauss found that a regular n-gon, say {n}, can be so constructed if 
the odd prime factors of n are distinct “Fermat primes” 


f= BF a. 1, 
The only known primes of this kind are 
Fo=tyel=3, A=W las, Mm=aw4+1= 17, 
Fz; = 28+ 1 = 257, Fa = 236 4 1 = 65537. 
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To inscribe a regular pentagon in a given circle, simpler constructions 
than Euclid’s were given by Ptolemy and Richmond.* The former has been 
repeated in many textbooks. The latter is as follows (Figure 2.1a). 

To inscribe a regular pentagon PoP;P2P3P,4 ina circle with center O: draw 
the radius OB perpendicular to OP9; join Po to D, the midpoint of OB; 
bisect the angle ODPp to obtain Ny on OP; and draw NP, perpendicular 
to OPo to obtain P; on the circle. Then PoP, is a side of the desired pen- 
tagon- 


P;, 8 - 
3 


N,F OE WN, 


Figure 2.1a Figure 2.1b 


Richmond also gave a simple construction for the {17} PoP1... Pis 
(Figure 2.1b). Join Po to J, one quarter of the way from Oto B. On the 
diameter through Pp take E, F, so that Z OJE is one quarter of OJP» and 
ZLFJE is 45°. Let the circle on FPo as diameter cut OB in K, and let the 
circle with center E and radius EK cut OPo in N3 (between O and Po) and 
Ns. Draw perpendiculars to OPo at these two points, to cut the original 
circle in P3 and Ps. Then the arc P3Ps5 (and likewise P;P3) is 7% of the cir- 
cumference. (The proof involves repeated application of the principle that 
the roots of the equation x? + 2x cot 2C — 1 = Oare tan C and —cot C.) 

Richelot and Schwendenwein constructed the regular 257-gon in 1832. 
J. Hermes spent ten years on the regular 65537-gon and deposited the manu- 
Script in a large box in the University of Géttingen, where it may still be 
found. 

The next number of the form F, = 22* + 1 is Fs = 4294967297. Fer- 
mat incorrectly assumed it to be prime. G. T. Bennett gave the following 
neat proof } that it is composite [Hardy and Wright 1, p. 14]: the number 


641 = 544 24 = 5-27 4 1, 
dividing both 54+ 228 4 232 and 54 - 228_ 1, divides their difference, which is Fs. 


*H. W. Richmond, Quarterly Journal of Mathematics, 26 (1893), pp. 296-297; see also H. E. 
Dudeney, Amusements in Mathematics (London 1917), p. 38. 
"+ Rediscovered by P. Kanagasabapathy, Mathematical Gazette, 42 (1958), p. 310. 
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The question naturally arises whether F;, may be prime for some greater 
value of k. It isnow known that this can happen only if F, divides 3°%,.- 1/2 
+ 1. Using this criterion, electronic computing machines have shown that 
F,, is composite for5 < k < 16. Therefore Hermes’s construction is the last 
of its kind that will ever be undertaken! 


EXERCISES 


1. Verify the correctness of Richmond’s construction for {5} (Figure 2. la). 


2. Assuming Richmond’s construction for {17}, how would you inscribe {51} in 
the same circle? 


2.2 ANGLE TRISECTION 


Ta trisect a given angle, we may praceed ta find the sine af the angle— 
say a—then, if x is the sine af an angle equal to ane-third af the given 
angle, we have 4x3 = 3x — a. 


W. W. Rouse Ball (1850 -1925) 
[Ball 1, p. 327] 


Gauss was almost certainly aware of the fact that his cyclotomic condition 
is necessary as well as sufficient, but he does not seem to have said so ex- 
plicitly. The missing step was supplied by Wantzel*, who proved that, if 
the odd prime factors of n are nor distinct Fermat primes, {n} cannot be 
constructed with ruler and compasses. For instance, since 7 is not a Fermat 
prime, Euclid’s instruments will not suffice for the regular heptagon {7}; and 
since the factors of 9 are not distinct, the same is true for the enneagon {9}. 

The problem of trisecting an arbitrary angle with ruler and compasses 
exercised the ingenuity of professional and amateur mathematicians for two 
thousand years [Ball 1, pp. 333-335]. It is, of course, easy to trisect cer- 
tain particular angles, such as a right angle. But any construction for tri- 
secting an arbitrary angle could be applied to an angle of 60°, and then we 
could draw a regular enneagon. In view of Wantzel’s theorem, we may say 
that it has been known since 1837 that the classical trisection problem can 
never be solved. 

This is probably the reason why Morley’s Theorem (§1.9) was not dis- 
covered till the twentieth century: people felt uneasy about mentioning the 
trisectors of an angle. However, although the trisectors cannot be con- 
structed by means of the ruler and compasses, they can be found in other 
ways [Cundy and Rollett 1, pp. 208-211]. Even if these more versatile in- 
struments had never been discovered, the theorem would still be meaning- 
ful. Most mathematicians are willing to accept the existence of things that 
they have not been able to construct. For instance, it was proved in 1909 
that the Fermat numbers F, and Fg are composite, but their smallest prime 
factors still remain to be computed. 


EXERCISE 


The number 2" + 1 is composite whenever n is not a power of 2, 
*P, L. Wantzel, Journal de Mathématiques pures et appliquées, 2 (1837), pp. 366-372. 
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2.3 ISOMETRY 


One way af describing the structure af space, preferred by bath New- 
ton and Helmholtz, is thraugh the nation of congruence. Congruent 
parts af space V, V’ are such as can be accupied by the same rigid 
bady in twa af its positions. If you mave the body from one inta the 
ather pasitian the particle af the bady cavering a paint P af V will after- 
wards caver a certain paint P’ af V’, and thus the result af the ma- 
tian isa mopping P — P’ af Vupan WV’. We can extend the rigid bady 
either octually ar in imagination so as ta caver an arbitrarily given 
paint P af space, and hence the cangruent mapping P —> P’ can be 
extended ta the entire space. 


Hermann Weyl (1885 -1955) 
[Weyl 1, p, 43] 


We shall find it convenient to use the word transformation in the special 
sense of a one-to-one correspondence P — P’ among all the points in 
the plane (or in space), that is, a rule for associating pairs of points, 
with the understanding that each pair has a first member P and a second 
member P’ and that every point occurs as the first member of just one pair 
and also as the second member of just one pair. It may happen that the 
members of a pair coincide, that is, that P’ coincides with P; in this case P 
is called an invariant point (or “double point”) of the transformation. 

In particular, an isometry (or “congruent transformation,” or “congru- 
ence’’) is a transformation which preserves length, so that, if (P, P’) and 
(Q, Q’) are two pairs of corresponding points, we have PQ = P’Q’: PQ 
and P’Q’ are congruent segments. For instance, a rotation of the plane about 
P (or about a line through P perpendicular to the plane) is an isometry hav- 
ing P as an invariant point, but a translation (or “parallel displacement’’) 
has no invariant point: every point is moved. 

A reflection is the special kind of isometry in which the invariant points 
consist of all the points on a line (or plane) called the mirror. 

A still simpler kind of transformation (So simple that it may at first seem 
too trivial to be worth mentioning) is the identity, which leaves every point 
unchanged. The result of applying several transformations successively is 
called their product. If the product of two transformations is the identity, 
each is called the inverse of the other, and their product in the reverse order 
is again the identity. 

2.31 Ifan isometry has more than one invariant point, it must be either the 
identity or a reflection. 


To prove this, let A and B be two invariant points, and P any point not on 
the line AB (Figure 1.35). The corresponding point P’, satisfying 


AP’ = AP, BP’ = BP, 


must lie on the circle with center A and radius AP, and on the circle with cen- 
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ter Band radius BP. Since P is not on AB, these circles do not touch each 
other but intersect in two points, one of which is P. Hence P’ is either P 
itself or the image of P by reflection in AB. 


2.4 SYMMETRY 


Tyger! Tyger! burning bright 

In the farests af the night, 

What immartal hand ar eye 

Dare frame thy fearful symmetry? 


William 8lake (1757 -1827) 


When we say that a figure is “symmetrical” we mean that we can apply 
certain isometries, called symmetry operations, which leave the whole figure 
unchanged while permuting its parts. For example, the capital letters E and 
A (Figure 2.4a) have bilateral symmetry, the mirror being horizontal for the 
former, vertical for the latter. The letter N (Figure 2.45) is symmetrical by a 
half-turn, or rotation through 180° (or “reflection in a point,” or “central in- 
version”), which may be regarded as the result of reflecting horizontally and 
then vertically, or vice versa. The swastika (Figure 2.4c) is symmetrical by 
rotation through any number of right angles. 


Figure 2.4a Figure 2.4b 


In counting the symmetry operations of a figure, it is usual to include the 
identity; any figure has this trivial symmetry. Thus the swastika admits four 
distinct symmetry operations: rotations through 1, 2, 3, or 4 right angles. 
The last is the identity. The first and third are inverses of each other, since 
their product is the identity. 

This use of the word “product” suggests an algebraic symbolism in which 
the transformations are denoted by capital letters while 1 denotes the iden~ 
tity. (Instead of 1, some authors write E.) Thus if S is the counterclockwise 
quarter-turn, the four symmetry operations of the swastika are 


S, S?, S83 =S8-! and S#= 1. 


Since the smallest power of S that is equal to the identity is the fourth power, 


SYMMETRY 3! 


we Say that S is of period4. Similarly S2, being a half-turn, is of period 2 [see 
Coxeter 1, p. 39]. The only transformation of period | is the identity. A 
transiation is aperiodic (that is, it has no period), but it is conveniently said 
to be of infinite period. 

Some figures admit both reflections and rotations as symmetry operations. 
The letter H (Figure 2.4d) has a horizontal mirror (like E) and a vertical 
mirror (like A), as well as a center of rotational symmetry (like N) where the 
two mirrors intersect. Thus it has four symmetry operations: the identity 1, 
the horizontal reflection Ri, the vertical reflection Re, and the half-turn 
RiRe = RoRy. 


Figure 2.4 Figure 2.4d 


EXERCISES 


1. Every isometry of period 2 is either a reflection or a half-turn [Bachmann 1, pp. 
2-3]. 

2. Express (a) a half-turn, (6) a quarter-turn, as transformations of (i) Cartesian co- 
ordinates, (ii) polar coordinates. (Take the origin to be the center of rotation.) 


2.5 GROUPS 


Symmetry, as wide ar as narraw as yau may define its meaning, is ane 
idea by which man through the ages has tried ta comprehend and 
create arder, beauty, and perfectian. 


Hermann Weyl [1, p. 5] 


A set of transformations [Birkhoff and MacLane 1, pp. 115-118] is said 
to form a group if it contains the inverse of each and the product of any two 
(including the product of one with itself or with its inverse). The number of 
distinct transformations is called the order of the group. (This may be either 
finite or infinite.) Clearly the symmetry operations of any figure form a 
group. This is called the symmetry group of the figure. In the extreme case 
when the figure is completely irregular (like the numeral 6) its symmetry 
group is of order one, consisting of the identity alone. 
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The symmetry group of the letter E or A (Figure 2.4a) is the so-called di- 
hedral group of order 2, generated by a single reflection and denoted by Dj. 
(The name is easily remembered, as the Greek origin of the word “dihedral” 
is almost equivalent to the Latin origin of “bilateral.”) The symmetry group 
of the letter N (Figure 2.45) is likewise of order 2, but in this case the genera- 
tor is a half-turn and we speak of the cyclic group, C2. The two groups Dy 
and Cy are abstractly identical or isomorphic, they are different geometrical 
representations of the single abstract group of order 2, defined by the re- 
lation 


2.51 R2 = | 


or R = R'! [Coxeter and Moser 1, p. 1]. 

The symmetry group of the swastika is C4, the cyclic group of order 4, 
generated by the quarter-turn S and abstractly defined by the relation $4 = 1. 
That of the letter H (Figure 2.4d) is De, the dihedral group of order 4, gener- 
ated by the two reflections Ry, Re and abstractly defined by the relations 


2.52 R,? = 1, Ro? = 1, R,Re — RoR. 


Although C4 and De both have order 4, they are not isomorphic: they have 
a different structure, different ““multiplication tables.” To see this, it suffices 
to observe that C4 contains two operations of period 4, whereas all the opera- 
tions in D2 (except the identity) are of period 2: the generators obviously, 
and their product also, since 


(RiRe)? = RyReRiRe = RiReReRi = RiRe?Ry = RiRi = R= 1. 


This last remark illustrates what we mean by saying that 2.52 is an ab- 
stract definition for D2, namely that every true relation concerning the genera- 
tors Ri, Reis an algebraic consequence of these simple relations. An alter- 
native abstract definition for the same group is 


2.53 RP=1, Re=1, (RiR2)? = 1, 
from which we can easily deduce RyRe = ReR1. 

The general cyclic group C,, of order n, has the abstract definition 
2.54 S? Sch 


Its single generator S, of period n, is conveniently represented by a rotation 
through 360°/n. Then S* is a rotation through k times this angle, and then 
operations in C, are given by the values of k from | to n, or from Oton — 1. 
In particular, C; occurs in nature as the symmetry group of the periwinkle 
flower. 


EXERCISE 


Express a rotation through angle a about the origin as a transformation of (i) polar 
coordinates, (ii) Cartesian coordinates. If f(r, 0) = 0 is the equation for a curve in polar 
coordinates, what is the equation for the transformed curve? 
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2.6 THE PRODUCT OF TWO REFLECTIONS 


Thau in thy lake dast see 
Thyself. 


J. M, Legoré (1823 -1859) 
(Ta a Lily) 
In any group of transformations, the associative law 
(RS)T = RST) 
is automatically satisfied, but the commutative law 
RS =SR 


does not necessarily hold, and care must be taken in inverting a product, for 
example, 


(RS)? = S1R4, 


not R-18-1_ (This becomes clear when we think of R and S as the operations 
of putting on our socks and shoes, respectively.) 


Cc B 


oO 
Figure 2.6a 


The product of reflections in two intersecting lines (or planes) is a rota- 
tion through twice the angle between them. In fact, if A, B, C, D,... are 
evenly spaced on a circle with center O, let Ry and Re be the reflections in 
OB and OC (Figure 2.6a). Then R, reflects the triangle OAB into OCB, 
which is reflected by Re to OCD; thus R1R¢ is the rotation through 4 AOC 
or Z BOD, which is twice 2 BOC. Since a rotation is completely determined 
by its center and its angle, R1Re is equal to the product of reflections in any 
two lines through O making the same angle as OB and OC. (The reflections 
in OA and OB are actually R1R2R, and Ry, whose product is R:R2R,2 = 
RiRo.) In particular, the half-turn about O is the product of reflections in 
any two perpendicular lines through O. 

Since R,Re is a counterclockwise rotation, ReRi is the corresponding 
clockwise rotation; in fact, 


ReRi = Re-1Ri-? = (RiR2)-}h. 
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This is the same as R1Rz if the two mirrors are at right angles, in which case 
RiRe is a half-turn and (R1R2)? =. 


EXERCISES 


1. The product of quarter-turns (in the same sense) about C and B is the half-turn 
about the center of a square having BC for a side. 

2. Let ACPQ and BARS be squares on the sides AC and BA of atriangle ABC. IfB 
and C remain fixed while A varies freely, PS passes through a fixed point. 


2.7 THE KALEIDOSCOPE 


De is a special case of the general dihedral group D,, which is, for n > 2, 
the symmetry group of the regular n-gon, {n}. (See Figure 2.7a for the cases 
n = 3,4,5.) This is evidently a group of order 2x, consisting of n rotations 
(through the n effectively distinct multiples of 360°/n) and n reflections. 
When n is odd, each of the n mirrors joins a vertex to the midpoint of the op- 
posite side; when x is even, 4” mirrors join pairs of opposite vertices and 41 
bisect pairs of opposite sides [see Birkhoff and MacLane 1, pp. 117-118, 135]. 


Cc 


Figure 2.7a 


The n rotations are just the operations of the cyclic group C,. Thus the 
operations of D, include all the operations of C,: in technical language, C, 
is a subgroup of D,. The rotation through 360°/n, which generates the sub- 
group, may be described as the product S = RiRg of reflections in two adja- 
cent mirrors (such as OB and OC in Figure 2.7@) which are inclined at 
180° /n. 

Let Ry, Ro, ..., R, denote the n reflections in their natural order of ar- 
rangement. Then R1R,41, being the product of reflections in two mirrors 
inclined at k times 180°/n, is a rotation through k times 360° /n: 


RiRggi = S*. 
Thus Ry41 = RiS*, and the v reflections may be expressed as 
Ry, RxS, RyS2,..., RiS*-1. 
In other words, D, is generated by Ry and S. By substituting RiRe for S, we 
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see that the same group is equally well generated by Ri and Ra, which satisfy 
the relations 


2.71 R;? = 1, Ro? — 1, (RiR2)” - il. 


(The first two relations come from 2.51 and the third from 2.54.) These re- 
lations can be shown to suffice for an abstract definition [see Coxeter and 
Moser 1, pp. 6, 36]. 

A practical way to make a model of D, is to join two ordinary mirrors by a 
hinge and stand them on the lines OB, OC of Figure 2.74 so that they are in- 
clined at 180°/n. Any object placed between the mirrors yields 2n visible 
images (including the object itself). If the object is your right hand, n of the 
images will look like a left hand, illustrating the principle that, since a reflec- 
tion reverses sense, the product of any even number of reflections preserves 
sense, and the product of any odd number of reflections reverses sense. 

The first published account of this instrument seems to have been by 
Athanasius Kircher in 1646. The name kaleidoscope (from kaAos, beautiful; 
evdos, a form; and okozew, to see) was coined by Sir David Brewster, who 
wrote a treatise on its theory and history. He complained [Brewster 1, p. 
147] that Kircher allowed the angle between the two mirrors to be any sub- 
multiple of 360° instead of restricting it to submultiples of 180°. 

The case when n = 2 is, of course, familiar. Standing between two per- 
pendicular mirrors (as at a corner of a room), you see your image in each 
and also the image of the image, which is the way other people see you. 

Having decided to use the symbol D, for the dihedral group generated 
by reflections in two planes making a “dihedral” angle of 180°/n, we 
naturally stretch the notation So as to allow the extreme valuen = 1. Thus 
Dy, is the group of order 2 generated by a single reflection, that is, the sym- 
metry group of the letter E or A, whereas the isomorphic group Co, generated 
by a half-turn, is the symmetry group of the letter N. 

According to Weyl[1, pp. 66 ,99], it was Leonardo da Vinci who discovered that the only finite 
groups of isometries in the plane are 


Ci, Ce, C3, 0-55 
Di, De, D3,.... 


His interest in them was from the standpoint of architectural plans. Of course, the prevalent 
groups in architecture have always been D; and Dz. But the pyramids of Egypt exhibit the group 
Dg, and Leonardo’s suggestion has been follawed to same extent in modern times: the Pentagon 
Building in Washington has the symmetry group Ds, and the Bahai Temple near Chicago has Dg. 
In nature, many flowers have dihedral symmetry groups such as Dg. The symmetry group of a 
snowflake is usually Dg but accasionally only D3. [Kepler 1, pp. 259-280.] 

If you cut an apple the way most people cut an orange, the core is seen to have the symmetry 
group Ds. Extending the five-pointed star by straight cuts in each half, you divide the whole 
apple into ten pieces from each of which the core can be removed in the form of two thin flakes. 


EXERCISES 


1. Describe the symmetry groups of 


(a) a scalene triangle, (b) an isosceles triangle, 
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(c) a parabola, (d) a parallelogram, 
(e) arhombus, (f) a rectangle, 
(g) an ellipse. 


2. Use inverses and the associative law to prove algebraically the “cancellation rule”’ 
which says that the relation 


RT = ST 


implies R = S, 
3. Show how the usual defining relations for D3, namely 2.71 with n = 3, may be de- 
duced by algebraic manipulation from the simpler relations 


Ri2 = 1, RyReRi = ReRiRe. 


4. Thecyclic group C,, is a subgroup of C, if and only if the number m is a divisor of 
n. Inparticular, ifn is prime, the only subgroups of C,, are C,, itself and C1. 


2.8 STAR POLYGONS 


Instead of deriving the dihedral group D, from the regular polygon {n}, 
we could have derived the polygon from the group: the vertices of the poly- 
gon are just the n images of a point Po (the C of Figure 2.7a) on one of the 
two mirrors of the kaleidoscope. In fact, there is no need to use the whole 
group D,: its subgroup C, will suffice. The vertex P, of the polygon 
PoP,... Py—1 can be derived from the initial vertex Po by a rotation through 
k times 360° /n. 

More generally, rotations about a fixed point O through angles 6, 26, 30, 
... transform any point Po (distinct from QO) into other points P1, Pe, Ps, 
... on the circle with center O and radius OPp. In general, these points 
become increasingly dense on the circle; but if the angle @ is commensura- 
ble with a right angle, only a finite number of them will be distinct. In par- 
ticular, if @ = 360° /n, where n is a positive integer greater than 2, then there 
will be n points P; whose successive joins 


PoP3, P1P2,..., Pn—1Po 


are the sides of an ordinary regular n-gon. 

Let us now extend this notion by allowing n to be any rational number 
greater than 2, say the fraction p/d (where p and d are coprime). Accord- 
ingly, we define a (generalized) regular polygon {n\, where n = p/d. Its p 
vertices are derived from Po by repeated rotations through 360°/n, and its 
p sides (enclosing the center d times) are 


PoPy, PyPo, othe Py-1Po. 


Since a ray coming out from the center without passing through a vertex 
will cross d of the p sides, this denominator d is called the density of the 
polygon [Coxeter 1, pp. 93-94]. When d = 1, so that n = p, we have the 
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ordinary regular p-gon, { p}. When d > 1, the sides cross one another, but 
the crossing points are not counted as vertices. Since d may be any posi- 
tive integer relatively prime to p and less than }p, there is a regular poly- 
gon {n} for each rational number n > 2. In fact, it is occasionally desir- 
able to include also the digon {2}, although its two sides coincide. 

When p = 5, we have the pentagon {5} of density 1 and the pentagram 
{3} of density 2, which was used as a special symbol by the Babylonians and 
by the Pythagoreans. Similarly, the octagram {§} and the decagram {'?} 
have density 3, while the dodecagram {'?} has density 5 (Figure 2.84). These 
particular polygons have names as well as symbols because they occur as 
faces of interesting polyhedra and tessellations.* 


my 
wo}oo 
a 


Figure 2.8a 


Polygons for which d > 1 are known as star polygons. They are fre- 
quently used in decoration. The earliest mathematical discussion of them 
was by Thomas Bradwardine (1290-1349), who became archbishop of Can- 
terbury for the last month of his life. They were also studied by the great 
German scientist Kepler (1571-1630) [see Coxeter 1, p. 114]. It was the 
Swiss mathematician L. Schlafli (1814-1895) who first used a numerical 
symbol such as { p/d}. This notation is justified by the occurrence of for- 
mulas that hold for {n} equally well whether n be an integer or a fraction. 
For instance, any side of {n} forms with the center O an isosceles triangle 
OPoP, (Figure 2.85) whose angle at O is 27/n. (As we are introducing 
trigonometrical ideas, it is natural to use radian measure and write 27 in- 
stead of 360°.) The base of this isosceles triangle, being a side of the poly- 
gon, is conveniently denoted by 2/. The other sides of the triangle are equal 
to the circumradius R of the polygon. The altitude or median from O is 
the inradius r of the polygon. Hence 


2.81 Re=lesc®, r=lcot2. 
n n 


If n = p/d, the area of the polygon is naturally defined to be the sum of 
the areas of the p isosceles triangles, namely 


*H.S. M. Coxeter, M. S. Longuet-Higgins, and J. C. P. Miller, Uniform polyhedra, Philosoph- 
ical Transactions of the Royal Society, A, 246 (1954), pp. 401-450. 
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Figure 2.8b 


2.82 pir = pr cot < 


When d = 1, this is simply p/? cot 7/p; in other cases our definition of area 
has the effect that every part of the interior is counted a number of times 
equal to the “local density” of that part; for example, the pentagonal re- 
gion in the middle of the pentagram {3} is counted twice. 

The angle PoP:P2 between two adjacent sides of {n}, being the sum of 
the base angles of the isosceles triangle, is the supplement of 27/n, namely 
2.83 ( 7 2) 

n 

The line segment joining the midpoints of two adjacent sides is called the 
vertex figure of {n}. Its length is clearly 
2.84 21 cos Z 
n 
{Coxeter 1, pp. 16, 94]. 

EXERCISES 


1. Ifthe sides of a polygon inscribed in a circle are all equal, the polygon is regular. 

2. Ifa polygon inscribed in a circle has an odd number of vertices, and all its an- 
gles are equal, the polygon is regular. (Marcel Riesz.) 

3. Find the angles of the polygons 


{5}, {3}, {9}, {3}, (4). 
4. Find the radii and vertex figures of the polygons 
{8}, {8}, {12}, (#}- 


5. Give polar coordinates for the kth vertex P;, of a polygon {n} of circumradius 
1 with its center at the pole, taking Po to be (1, 0). 


6. Cana square cake be cut into nine slices so that everyone gets the same amount of 
cake and the same amount of icing? 


3 


Isometry in the Euclidean plane 


Having made some use of reflections, rotations, and translations, we nat- 
urally ask why a rotation or a translation can be achieved as a continuous 
displacement (or “motion”) while a reflection cannot. It is also reasonable 
to ask whether there is any other kind of isometry that resembles a reflec- 
tion in this respect. After answering these questions in terms of “sense,” 
we shall use the information to prove a remarkable theorem (§ 3.6) and to 
describe the seven possible ways to repeat a pattern on an endless strip 


(§ 3.7). 


3.1 DIRECT AND OPPOSITE ISOMETRIES 


“Take care af the sense, and the saunds will take care af themselves."’ 


Lewis Carrall 


[Dodgson 1, Chap. 9] 


By several applications of Axiom 1.26, it can be proved that any point P 
in the plane of two congruent triangles ABC, A’B’C’ determines a corre- 
sponding point P’ such that AP = A’P’, BP = BP’, CP = C’P’. Likewise 
another point Q yields Q’, and PQ = P’Q’. Hence 


3.11 Any two congruent triangles are related by a unique isometry. 


In § 1.3, we saw that Pappus’s proof of Pons asinorum involved the com- 
parison of two coincident triangles ABC, ACB. We see intuitively that this 
is a distinction of sense: if one is counterclockwise the other is clockwise. 
It is a “topological” property of the Euclidean plane that this distinction 
can be extended from coincident triangles to distinct triangles: any two 
“directed” triangles, ABC and A’B’C’, either agree or disagree in sense. 
(For a deeper investigation of this intuitive idea, see Veblen and Young [2, 
pp. 61-62] or Denk and Hofmann (1, p. 56].) 

If ABC and A’B’C’ are congruent, the isometry that relates them is said 
to be direct or opposite.according as it preserves or reverses sense, that is, 
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according as ABC and A’B’C’ agree or disagree. It is easily seen that this 
property of the isometry is independent of the chosen triangle ABC: if the 
same isometry relates DEF to D’E’F’, where DEF agrees with ABC, then 
also D’E’F’ agrees with A’B’C’. Clearly, direct and opposite isometries 
combine like positive and negative numbers (e.g., the product of two oppo- 
site isometries is direct). Since a reflection is opposite, a rotation (which is 
the product of two reflections) is direct. In particular, the identity is direct. 
Some authors call direct and opposite isometries “displacements and re- 
versals” or “proper and improper congruences.” 
Theorem 2.31 can be extended as follows: 


Figure 3.1a 


3.12 Two given congruent line segments (or point pairs) AB, A’B’ are re- 
lated by just two isometries: one direct and one opposite. 


To prove this, take any point C outside the line AB, and construct C’ so 
that the triangle A’B’C’ is congruent to ABC. The two possible positions 
of C’ (marked C’, C” in Figure 3.1a) provide the two isometries. Since either 
can be derived from the other by reflecting in A’B’, one of the isometries is 
direct and the other opposite. 

For a complete discussion we need the following theorem [Bachmann 1, 


p. 3): 


B B 


Figure 3.1b 


SENSE 4! 


3.13 Every isometry of the plane is the product of at most three reflections. 
If there is an invariant point, “three” can be replaced by “two.” 


We prove this in four stages, using 3.11. Trivially, if the triangles ABC, 
A’B’C’ coincide, the isometry is the identity (which is the product of a re- 
flection with itself). If A coincides with A’, and B with B’, while C and C’ 
are distinct, the triangles are related by the reflectionin AB. The case when 
only A coincides with A’ can be reduced to one of the previous cases by re- 
flecting ABC in m, the perpendicular bisector of BB’ (see Figure 3.15). Fi- 
nally, the general case can be reduced to one of the first three cases by re- 
flecting ABC in the perpendicular bisector of AA’ [Coxeter 1, p. 35]. 


Since a reflection reverses sense, an isometry is direct or opposite accord- 
ing as it is the product of an even or odd number of reflections. 

Since the identity is the product of two reflections (namely of any reflec- 
tion with itself), we may say simply that any isometry is the product of two 
or three reflections, according as it is direct or opposite. In particular, 


3.14 Any isometry with an invariant point is a rotation or a reflection ac- 
cording as it is direct or opposite. 


EXERCISES 


1. Name two direct isometries. 

2. Name one opposite isometry. Is there any other kind? 

3. If AB and A’B’ are related by a rotation, how can the center of rotation be con- 
structed? (Hint: The perpendicular bisectors of AA’ and BB’ are not necessarily dis- 
tinct.) 

4. The product of reflections in three lines through a point is the reflection in an- 
other line through the same point [Bachmann 1, p. 5]. 


3.2 TRANSLATION 


Enach walked with Gad; and he was nat, far Gad taak him. 
Genesis V, 24 


The particular isometries so far considered, namely reflections (which are 
opposite) and rotations (which are direct), have each at least one invariant 
point. A familiar isometry that leaves no point invariant is a translation 
[Bachmann 1, p. 7], which may be described as the product of half-turns 
about two distinct points O, O’ (Figure 3.2a). The first half-turn trans- 
forms an arbitrary point P into P", and the second transforms this into P', 
with the final result that PP? is parallel to OO’ and twice as long. Thus the 
length and direction of PP’ are constant: independent of the position of P. 
Since a translation is completely determined by its length and direction, 
the product of half-turns about O and O’ is the same as the product of half- 
turns about Q and Q’, provided QQ’ is equal and parallel to OO’. (This 
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means that OO’Q’Q is a parallelogram, possibly collapsing to form four 
collinear points, as in Figure 3.24.) Thus, for a given translation, the cen- 
ter of one of the two half-turns may be arbitrarily assigned. 


3.21 The product of two translations is a translation. 


For, we may arrange the centers so that the first translation is the prod- 
uct of half-turns about O; and Og, while the second is the product of half- 
turns about O2 and O3. When they are combined, the two half-turns about 
O2 cancel, and we are left with the product of half-turns about O, and O3. 


Figure 3.2a Figure 3.2b 


Similarly, if m and m’ (Figure 3.25) are the lines through O and O’ per- 
pendicular to OO’, the half-turns about O and O’ are the products of re- 
flections in m and OO’, OO’ and m’. When they are combined, the two 
reflections in OO’ cancel, and we are left with the product of reflections in 
mand m’. Hence 

3.22 The product of reflections in two parallel mirrors is a translation 
through twice the distance between the mirrors. 

If a translation T takes P to P™ and Q to Q", the segment QQ" is equal 
and parallel to PP"; therefore PQQ"P" is a parallelogram. Similarly, if an- 
other translation U takes P to Q, it also takes P™ to Q"; therefore 


TU = UT. 
(In detail, if Q is P*, QT is PY’. But U takes P* to P?’. Therefore PTY 
and P*" coincide, for all positions of P.) In other words, 

3.23 Translations are commutative. 

The product of a half-turn H and a translation T is another half-turn; for 
we can express the translation as the product of two half-turns, one of which 
is H, say T = HH’, and then we have 

HT = H?H’ = H’: 
3.24 The product of a half-turn and a translation is a half-turn. 
EXERCISES 


1. If Tis the product of half-turns about O and O’, what is the product of half- 
turns about O’ and O? 

2. When a translation is expressed as the product of two reflections, to what ex- 
tent can one of the two mirrors be arbitrarily assigned? 
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3. What is the product of rotations through opposite angles (a and —a) about two 
distinct points? 

4. The product of reflections in three parallel lines is the reflection in another line 
belonging to the same pencil of parallels. 

5. Every product of three half-turns is a half-turn [Bachmann 1, p. 7]. 

6. If Wi, He, Hs are half-turns, H,H2H3 = H3He2Hy,. 

7. Express the translation through distance a along the x-axis as a transformation 
of Cartesian coordinates, If f(x, y) = 0 is the equation for a curve, what is the equa- 
tion for the transformed curve? Consider, for instance, the circle x? + y2 — 1 = 0. 


3.3 GLIDE REFLECTION 


We are now familiar with three kinds of isometry: reflection, rotation, 
and translation. Another kind is the glide reflection (or simply “glide”), 
which is the product of the reflection in a line a and a translation along the 
same line. Picture this line as a straight path through snow; then, consecu- 
tive footprints are related by a glide. Such an isometry is determined by its 
axis a and the extent of the component translation. Since a reflection is op- 
posite whereas a translation is direct, their product is opposite. Thus a 
glide reflection is an opposite isometry having no invariant point [Coxeter 1, 
p. 36]. 

If a glide reflection G transforms an arbitrary point P into P¢ (Figure 3.3a), 
P and P®% are equidistant from the axis a on opposite sides. Hence 


The midpoint of the line segment PP® lies on the axis for all positions of P. 


ps 


Figure 3.3a Figure 3.3b 


Let Ry and T denote the component reflection and translation. They 
evidently commute, so that 


G = RiT = TR. 


We have seen (Figure 3.2b) that the translation T may be expressed as the 
product of two half-turns or of two parallel reflections. Identifying the line 
a in Figure 3.3a with the line OO’ in Figure 3.3b, let R, R’ denote the re- 
flections in m, m’. Then the product of the two half-turns 


H = RR, = R,R, H’ = R’R, = R,R’ 
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is T = HH’ = RRiRyR’ = RR’, 
and the glide reflection is 
G = RT = Ri RR’ = HR’ 
= TR: = RR’R, = RH’. 


Thus a glide reflection may be expressed as the product of three reflections 
(two perpendicular to the third), or of a half-turn and a reflection, or of a 
reflection and a half-turn. Conversely, the product of any half-turn and 
any reflection (or vice versa) is a glide reflection, provided the center of the 
half-turn does not lie on the mirror. [Bachmann 1, p. 6.] 

We saw in 3.13 that any direct isometry in the plane is the product of two 
reflections, that is, a translation or a rotation according as the two mirrors 
are parallel or intersecting; also that any opposite isometry with an invari- 
ant point is a reflection. To complete the catalog of isometries, the only 
remaining possibility is an opposite isometry with no invariant point. If 
such an isometry S transforms an arbitrary point A into A’, consider the 
half-turn H that interchanges these two points. The product HS, being an 
opposite isometry which leaves the point A’ invariant, can only be a reflec- 
tion R. Hence the given opposite isometry is the glide reflection 


S = H!R = HR: 


Every opposite isometry with no invariant point is a glide reflection. 


In other words, 


3.31 Every product of three reflections is either a single reflection or a 
glide reflection. 


In particular, the product RT of any reflection and any translation is a 
glide reflection, degenerating to a pure reflection when the mirror for R is 
perpendicular to the direction of the translation T (in which case the reflec- 
tions R and RT may be used as the two parallel reflections whose product 
is T). But since a given glide reflection G has a definite axis (the locus of 
midpoints of segments PP), its decomposition into a reflection and a trans- 
lation along the mirror is unique (unlike its decomposition into a reflection 
and a half-turn, where we may either take the mirror to be any line per- 
pendicular to the axis or equivalently take the center of the half-turn to be 
any point on the axis). 


EXERCISES 
1. If Bis the midpoint of AC, what kinds of isometry will transform 
(i) ABinto CB, (ii) AB into BC? 


2. Every direct isometry is the product of two reflections. Every opposite isometry 
is the product of a reflection and a half-turn. 

3. Describe the product of the reflection in OO’ and the half-turn about O. 

4. Describe the product of two glide reflections whose axes are perpendicular. 
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5. Every product of three glide reflections is a reflection or a glide reflection. 


6. The product of three reflections is a reflection if and only if the three mirrors 
are either concurrent or parallel. 


7. If Ri, Re, Rg are three reflections, (RiR2R3)? is a translation [Rademacher and 
Toeplitz 1, p. 29]. 
8. Describe the transformation 


(% y)—> (* + 4, —y). 
Justify the statement that this transforms the curve f(x, y) = Ointo f(x — a, —y) = 0. 


3.4 REFLECTIONS AND HALF-TURNS 


Thomsen* has developed a very beautiful theory in which geometrical 
properties of points O, Oi, O2,... and lines m, m1, m2, .. . (understood to be 
all distinct) are expressed as relations among the corresponding half-turns 
H, Hi, Ho, ... and reflections R, Ry, Re,.... The reader can soon convince 
himself that the following pairs of statements are logically equivalent: 


RR; = R,R <> mand my, are perpendicular. 

HR = RH <—- Oliesonm. 

R,ReR3 = R3Re2Ry <> ~~ my, me, mg are either concurrent or parallel. 

H,H = HH2 <— Oisthe midpoint of 0,02. 

H,R = RH2 <— _ mis the perpendicular bisector of 0,02. 
EXERCISE 


Interpret the relations (a) H}H2H3H, = 1; (b) RiR = RRo. 


3.5 SUMMARY OF RESULTS ON ISOMETRIES 


And thick and fast they came at last, 
And mare, and mare, and mare. 
Lewis Carrall 

[Dodgson 2, Chap. 4] 


Some readers may have become confused with the abundance of techni- 
cal terms, many of which are familiar words to which unusually precise 
meanings have been attached. Accordingly, let us repeat some of the defi- 
Nitions, stressing both their analogies and their differences. 


* G. Thomsen, The treatment of elementary geometry by a group-calculus, Mathematical Ga- 
zetté, 17 (1933), p.232. Bachmann [1] devotes a whale book to the development of this idea. 
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In all the contexts that concern us here, a transformation is a one-to-one 
correspondence of the whole plane (or space) with itself. An isometry is a 
special kind of transformation, namely, the kind that preserves length. A 
symmetry operation belongs to a given figure rather than to the whole plane: 
it is an isometry that transforms the figure into itself. 

In the plane, a direct (sense-preserving) isometry, being the product of 
two reflections, is a rotation or a translation according as it does or does 
not have an invariant point, that is, according as the two mirrors are inter- 
secting or parallel. In the latter case the length of the translation is twice 
the distance between the mirrors; in the former, the angle of the rotation 
is twice the angle between the mirrors. In particular, the product of reflec- 
tions in two perpendicular mirrors is a half-turn, that is, a rotation through 
two right angles. Moreover, the product of two half-turns is a translation. 

An opposite (sense-reversing) isometry, being the product of three reflec- 
tions, is, in general, a glide reflection: the product of a reflection and a transla- 
tion. In the special case when the translation is the identity (i.e., a translation 
through zero distance), the glide reflection reduces to a single reflection, 
which has a whole line of invariant points, namely, all the points on the 
mirror. 

To sum up: . 


3.51 Any direct isometry is either a translation or a rotation. Any oppo- 
site isometry is either a reflection or a glide reflection. 


EXERCISES 


1. IfS is an opposite isometry, S? is a translation. 

2. If Ri, Re, Rg are three reflections, (R2R3R1R2R3)? is a translation along the first 
mirror. (Hint: Since RiR2R3 and R2RsR;, are glide reflections, their squares are com- 
mutative, by 3.23; thus 


(RyRoRz)2(ReR3Ry)? = (R2R3R1)2(RiR2R3)2, 
that is, Ry and (ReR3RiR2R3)? are commutative [cf. Bachmann 1, p. 13].) 


3.6 HJELMSLEV’S THEOREM 


... avery high degree af unexpectedness, combined with inevitability 
and ecanamy. 


G.H. Hardy [2, p. 53] 


We saw, in 3.12, that two congruent line segments AB, A’B’, are related by 
just two isometries: one direct and one opposite. Both isometries have the 
same effect on every point collinear with A and B, that is, every point on 
the infinite straight line AB (for instance, the midpoint of AB is transformed 
into the midpoint of A’B’). The opposite isometry is a reflection or glide 
reflection whose mirror or axis contains all the midpoints of segments join- 
ina naire nf eorresnond ng v ints. If two of these midpoints coincide, the 
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direct isometry is a half-turn, and they all coincide [Coxeter 3, p. 267]. 
Hence 

HJELMSLEV’S THEOREM. When all the points P on one line are related by 
an isometry to all the points P’ on another, the midpoints of the segments PP’ 
are distinct and collinear or else they all coincide. 


Cc 
B | 
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-_ c 
A | | 
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aed B a 
i | | 
l | | 
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| A 
B \ 
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Figure 3.6a 


In particular, if A, B, C are on one line and A’, B’, C’ on another, with 
3.61 AB = A’B, BC= BC 


(Figure 3.6a), then the midpoints of AA’, BB’, CC’ are either collinear or 
coincident (J. T. Hjelmslev, 1873-1950). 


3.7. PATTERNS ON A STRIP 


P PR pt 
PH PG 
Figure 3.7a 


Any kind of isometry may be used to relate two equal circles. For in- 
Stance, the point P on the first circle of Figure 3.7a is transformed into P* 
on the second circle by a translation, into P® by a reflection, into P™ by a 
half-turn, and into P® by a glide reflection. (Arrows have been inserted to 
indicate what happens to the positive sense of rotation round the first cir- 
cle.) These four isometries have one important property in common: they 
leave invariant (as a whole) one infinite straight line, namely, the line 
joining the centers of the two circles. (In the fourth case this is the only in- 
variant line.) 
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We have seen (Figure 3.2b) that the product of reflections in two parallel 
mirrors m, m’ is a translation, This may be regarded as the limiting case 
of a rotation whose center is very far away; for the two parallel mirrors are 
the limiting case of two mirrors intersecting at a very small angle. Accord- 
ingly, the infinite group generated by a single translation is denoted by C,,, 
and the infinite group generated by two parallel reflections is denoted by 
D,,. Abstractly, C,, is the “free group with one generator.” IfT is the gen- 
erating translation, the group consists of the translations 


Jeng TR, TER. ye TE OT aes 


m m’ 
RR’R R’R R 1 R’ RR’ 


Figure 3.7b 


Similarly, D,,, generated by the reflections R, R’ in parallel mirrors m, m’ 
(Figure 3.7b), consists of the reflections and translations 


..., RR’R, R’R, R, 1, R’, RR’, R’/RR’,... 
{Coxeter 1, p. 76]; its abstract definition is simply 
R?2 = R? = 1, 


This group can be observed when we sit in a barber’s chair between two 
parallel mirrors (cf. the New Yorker, Feb. 23, 1957, p. 39, where somehow 
the reflection RR’RR’R yields a demon). 

A different geometrical representation for the same abstract group D,, 
is obtained by interpreting the generators R and R’as half-turns. There is 
also an intermediate representation in which one of them is a reflection and 
the other a half-turn; but in this case their product is no longer a transla- 
tion but a glide reflection. 

Continuing in this manner, we could soon obtain the complete list of the 
seven infinite ‘“‘one-dimensional” symmetry groups: the seven essentially 
distinct ways to repeat a pattern on a strip or ribbon [Speiser 1, pp. 81-82]: 


Typical pattern Generators Abstract Group 
@iecle Te Ey Ey 1 translation } @ 
@i)...L Tr LU... 1 glide reflection ig 

@iij)...V VV V 2 reflections 
(iv)...N NNN 2 half-turns dD, 
(v)...VAVA 1 reflection 
and 1 half-turn 
(vi)...D DDD 1 translation 
and 1 reflection C,, x Dy 
(vil)... H HHH 3 reflections D.. xX Dy 
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In (iti), the two mirrors are both vertical, one in the middle of a V, reflect- 
ing it into itself, while the other reflects this V into one of its neighbors; thus 
one half of the V, placed between the two mirrors, yields the whole pattern. 
In (vi) and (vii) there is a horizontal mirror, and the symbols in the last 
column indicate “direct products” [Coxeter 1, p. 42]. For all these groups, 
except (1) and (ii), there is some freedom in choosing the generators; for ex- 
ample, in (iii) or (iv) one of the two generators could be replaced by a transla- 
tion. 

Strictly speaking, these seven groups are not “1-dimensional” but “14- 
dimensional;” that is, they are 2-dimensional symmetry groups involving 
translation in one direction. In a purely one-dimensional world there are 
only two infinite symmetry groups: C,,, generated by one translation, and 
D.,, generated by two reflections (in point mirrors). 


EXERCISES 
1. Identify the symmetry groups of the following patterns: 
..bbbb..., 
..bpbp..., 
...bdbd..., 
..bqbdq..., 


..bdpqbdpgq... 
2. Which are the symmetry groups of (a) a cycloid, (b) a sine curve? 
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Two-dimensional crystallography 


Mathematical crystallography provides one of the most important appli- 
cations of elementary geometry to physics. The three-dimensional theory is 
complicated, but its analog in two dimensions is easy to visualize without 
being trivial. Patterns covering the plane arise naturally as an extension 
of the strip patterns considered in §3.7. However, in spite of the restric- 
tion to two dimensions, a complete account of the enumeration of infinite 
symmetry groups is beyond the scope of this book. 


4.1 LATTICES AND THEIR DIRICHLET REGIONS 


For some minutes Alice stood without speaking, looking out in all direc- 
tions over the country ... “| declare it's marked out just like a large 
chessboard ... all over the world—if this is the world af all.” 


Lewis Carroll 


[Dodgson 2, Chap. 2] 


Infinite two-dimensional groups (the symmetry groups of repeating pat- 
terns such as those commonly used on wallpaper or on tiled floors) are dis- 
tinguished from infinite “one-dimensional” groups by the presence of 
independent translations, that is, translations whose directions are neither 
parallel nor opposite. The crystallographer E. S. Fedorov showed that there 
are just seventeen such two-dimensional groups of isometries. They were re- 
discovered in our own century by Polya and Niggli.* The symbols by which 
we denote them are taken from the International Tables for X-ray Crystal- 
lography. 

The simplest instance is the group p1, generated by two independent 


* E. S. Fedorov, Zapiski Imperatorskogo S. Peterburgskogo Mineralogicheskogo Obshchestva 
(2), 28 (1891), pp. 345-390; G. Pélya and P. Niggli, Zeitschrift fiir Kristallographie und Mineralo- 
gie, 60 (1924), pp. 278-298. [See also Fricke and Klein 1, pp. 227-233.] Fedorov’s table shows 
that 16 of the 17 groups had been described by C. Jordan in 1869. The remaining one was rec- 
ognized by L. Sohncke in 1874; but he missed three others. 
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Figure 4.1a 


translations X, Y. Since the inverse of a translation is a translation, and the 
product of two translations is a translation (3.21), this group consists entirely 
of translations. Since XY = YX, these translations are simply X*Y¥ for all 
integers x, y. Abstractly, this is the “direct product” C,, x C,,, which has 
the single defining relation 


XY = YX 


{Coxeter and Moser 1, p. 40]. Any object, such as the numeral 6 in Figure 
4.1a, is transformed by the group p1 into an infinite array of such objects, 
forming a pattern. Conversely, p1 is the complete symmetry group of the 
pattern, provided the object has no intrinsic symmetry. If the object is a sin- 
gle point, the pattern is an array of points called a two-dimensional /attice, 
which may be pictured as the plan of an infinite orchard. Each lattice point 
is naturally associated with the symbol for the translation by which it is 
derived from the original point 1 (Figure 4.16). 


(eo) fe) ce) (eo) (eo) (eo) 
yoy? y? xy? X*yy 
(eo) (eo) (eo) (eo) fe) Oo 
xy Y XY any 
ae fe) Oo (eo) Oo fe) 
x7} 1 X x? 
(eo) Oo fe) fe) (eo) (eo) 
yor xy} 
v2 (eo) fe) Oo cP fe} 
Figure 4.1b 


Anyone standing in an orchard observes the alignment of trees in rows in 
many directions. This exhibits a characteristic property of a lattice: the line 
joining any two of the points contains infinitely many of them, evenly spaced, 
that is, a “one-dimensional lattice.” In fact, the line joining the points 1 
and X*Y¥ contains also the points 


XKre/dyny/d — (X2/4Yu/dyn 
where d is the greatest common divisor of x and y, and n runs over all the 
integers. In particular, the powers of X all lie on one line, the powers of 
Y on another, and lines parallel to these through the remaining lattice points 
form a tessellation of congruent parallelograms filling the plane without in- 
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terstices (Figure 4.1c). (We use the term tessellation for any arrangement 
of polygons fitting together so as to cover the whole plane without over- 


lapping.) 
YT XT 
T XT 
Y XY 
1 Xx 


Figure 4.1c¢ 


A typical parallelogram is formed by the four points 1, X, XY, Y. The 
translation T = X*Y¥ transforms this parallelogram into another one hav- 
ing the point T (instead of 1) at its “first” corner. There is thus a one-to- 
one correspondence between the cells or tiles of the tessellation and the 
transformations in the group, with the property that each transformation 
takes any point inside the original cell to a point similarly situated in the new 
cell. For this reason, the typical parallelogram is called a fundamental region. 

The shape of the fundamental region is far from unique. Any paratlelo- 
gram will serve, provided it has four lattice points for its vertices but no 
others on its boundary or inside [Hardy and Wright 1, p. 28]. This is the 
geometrical counterpart of the algebraic statement that the group generated 
by X, Y is equally well generated by X*Y°, X°Y¢, provided 


ad — be = 1. 
To express the old generators in terms of the new, we observe that 
(XPV) KY 4)—8 = Xad—be, (X#Y¥®)—(X°¥ 4)2 = Yad—-be, 


But there is no need for the fundamental region to be a parallelogram at all; 
for example, we may replace each pair of opposite sides by a pair of con- 
gruent curves, as in Figure 4.1d. 


° 1?) ° 


Figure 4.1d 
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Every possible fundamental region, whether we choose a parallelogram 
or any other shape, has the same area as the typical parallelogram of Figure 
4.lc. For, inside a sufficiently large circle, the number of lattice points is 
equal to the number of replicas of any fundamental region (with an insignifi- 
cant error due to mutilated regions at the circumference); thus every possible 
shape has for its area the same fraction of the area of the large circle.* Itis 
an interesting fact that any convex fundamental region for the translation 
group is a centrally symmetrical polygon (namely, a parallelogram or a cen- 
trally symmetrical hexagon).} 

Among the various possible parallelograms, we can select a standard or 
reduced parallelogram by taking the generator Y to be the shortest trans- 
lation (or one of the shortest) in the group, and X to be an equal or next 
shortest translation in another direction. Ifthe angle between X and Y then 
happens to be obtuse, we reverse the direction of Y. Thus, among all the 
parallelograms that can serve as a fundamental region, the reduced paral- 
lelogram has the shortest possible sides. The translations along these sides 
are naturally called reduced generators. 


yal xy71 


Figure 4.le 


By joining the vertices X, Y of the reduced parallelogram, and the cor- 
responding pair of vertices of each replica, we obtain a tessellation of con- 
gruent triangles whose vertices are lattice points and whose angles are 
nonobtuse. Each lattice point belongs to six of the triangles; for example, 
the triangles surrounding the point | join it to pairs of adjacent points in 
the cycle 


xX, Y, X?TY, X11, Y!, XY7! 


(Figure 4.le). By joining the circumcenters of these six triangles, we obtain 
the Dirichlet region (or “Voronoi polygon’) of the lattice: a polygon whose 
interior consists of all the points in the plane which are nearer to a particu- 


* Gauss used this idea as a means of estimating [Hilbert and Cohn-Vossen 1, pp. 33-34]. 
+ A.M. Macbeath. Canadian Journal of Mathematics, 13 (1961), p. 177. 
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lar lattice point (such as the point 1) than to any other lattice point.* Such 
regions, each surrounding a lattice point, evidently fit together to fill the 
whole plane; in fact, the Dirichlet region is a particular kind of fundamental 
region. 

The lattice is symmetrical by the half-turn about the point | (or any other 
lattice point). For this half-turn interchanges the pairs of lattice points 
XY", X-"Y~", (In technical language, the group p1 has an automorphism of 
period 2 which replaces X and Y by their inverses.) Hence the Dirichlet 
region is symmetrical by a half-turn. Its precise shape depends on the relative 
lengths of the generating translations X, Y and the angle between them. If 
this angle is a right angle, the Dirichlet region is a rectangle (or a square), 
since the circumcenter of a right-angled triangle is the midpoint of the 
hypotenuse. In all other cases it is a hexagon (not necessarily a regular hexa- 
gon; but since it is centrally symmetrical, its pairs of opposite sides are equal 
and parallel). 

Varying the lattice by letting the angle between the translations X and Y 
increase gradually to 90°, we see that two opposite sides of the hexagon 
shrink till they become single vertices, and then the remaining four sides 
form a rectangle (or square). 

Reflections in the four or six sides of the Dirichlet region transform the 
central lattice point | into four or six other lattice points which we naturally 
call the neighbors of the point |. 


EXERCISES 


1. Any two opposite sides of a Dirichlet region are perpendicular to the line joining 
their midpoints. 

2. Sketch the various types of lattice that can arise if X and Y are subject to the fol- 
lowing restrictions: they may have the same length, and the angle between them may be 
90° or 60°. Indicate the Dirichlet region in each case, and state whether the symmetry 
group of this region is Cz, De, D4, or De. 


4.2 THE SYMMETRY GROUP OF THE GENERAL LATTICE 


The investigation of the symmetries of a given mathematical structure 
has always yielded the most powerful results. 


E. Artin (1898 -1962) 
[Artin 1, p. 54] 


Any given lattice is easily seen to be symmetrical by the half-turn about the 
midpoint of the segment joining any two lattice points [Hilbert and Cohn- 
Vossen 1, p. 73]. Such midpoints form a lattice of finer mesh, whose gen- 
erating translations are half as long as X and Y (see the “open” points in 
Figure 4.2a). 

* G. L. Dirichlet, Journal fur die reine und angewandte Mathematik, 40 (1850), pp. 216-219, 
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° fo) ° ° ° 
ey ° e XY ° 0x2 
° ° ° 
ae. ex ° eX? 
Figure 4.2a 


The “general” lattice occurs when the reduced generators differ in length 
and the angle between them is neither 90° nor 60°. In such a case, the trans- 
lations X7Y" and the above-mentioned half-turns are its only symmetry 
operations. In other words, the symmetry group of the general lattice is 
derived from p1 by adding an extra transformation H, which is the half-turn 
about the point 1. This group is denoted by p2 [Coxeter and Moser 1, pp. 
41-42]. Itis generated by the half-turn H and the translations X, Y, in terms 
of which the half-turn that interchanges the points 1 and T = X*Y"is HT. 
(Note that T itselfis the product of Hand HT.) The group is equally well 
generated by the three half-turns HX, H, HY, or (redundantly) by these three 
and their product 


HX-H-HY = HXy, 


which are half-turns about the four vertices of the parallelogram shown in 
Figure 4.2a. 

' It is remarkable that any triangle or any simple quadrangle (not neces- 
sarily convex) will serve as a fundamental region for p2. Half-turns about 
the midpoints of the three or four sides may be identified with HX, H, HY 
(Figure 4.26), or HX, H, HY, HXY (Figure 4.2c). 


Figure 4.2b 


ESCHER’S PICTURES 57 
EXERCISES 


1. Why do the vertices of the quadrangles in Figure 4.2c form two superposed lat- 
tices? 

2. Draw the tessellation of Dirichlet regions for a given lattice. Divide each region 
into two halves by means ofa diagonal. The resulting tessellation is a special case of the 
tessellation of scalene triangles (Figure 4.25) or of irregular quadrangles (Figure 4.2c) 
according as the Dirichlet region is rectangular or hexagonal. 


Piate i 


4.3. THE ART OF M. C. ESCHER 


The groups p1 and p2 are two of the simplest of the seventeen discrete 
groups of isometries involving two independent translations. Several others 
will be mentioned in this section and the next. Convenient generators for all 
of them are listed in Table I on p. 413. 
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The art of filling a plane with a repeating pattern reached its highest de- 
velopment in thirteenth-century Spain, where the Moors used all the seven- 
teen groups in their intricate decoration of the Alhambra [Jones 1]. Their 
preference for abstract patterns was due to their strict observance of the 
Second Commandment. The Dutch artist M. C. Escher, free from such 
scruples, makes an ingenious application of these groups by using animal 
shapes for their fundamental regions. For instance, the symmetry group of 
his pattern of knights on horseback (Plate I) seems at first sight to be p1, 
generated by a horizontal translation and a vertical translation. But by 
ignoring the distinction between the dark and light specimens we obtain the 
more interesting group pg, which is generated by two parallel glide reflec- 
tions, say G and G’. We observe that the vertical translation can be ex- 
pressed equally well as G2 or G’2. It is remarkable that the single relation 


G? = G? 


provides a complete abstract definition for this group [Coxeter and Moser 
1,p. 43]. Clearly, the knight and his steed (of either color) constitute a fun- 
damental region for pg. But we must combine two such regions, one dark 
and one light, in order to obtain a fundamental region for p1. 

Similarly, the symmetry group of Escher’s pattern of beetles (Plate IJ) 
seems at first sight to be pm, generated by two vertical reflections and a ver- 
tical translation. But on looking more closely we see that there are both 
dark and light beetles, and that the colors are again interchanged by glide 
reflections. The complete symmetry group em, whose fundamental region 
is the right or left half of a beetle of either color, is generated by any such 
vertical glide reflection along with a vertical reflection. To obtain a funda- 
mental region for the “smaller” group pm, we combine the right half of a 
beetle of either color with the left half of an adjacent beetle of the other 
color. 

A whole beetle (of either color) provides a fundamental region for the 
group p1 (with one of its generating translations oblique) or equally well for 
Pg. 


EXERCISES 


1. Locate the axes of two glide reflections which generate pg in Plates I and II. 


2. Any two parallelograms whose sides are in the same two directions can together 
be repeated by translations to fill the plane. 


4.4 SIX PATTERNS OF BRICKS 


Figure 4.4a shows how six of the seventeen two-dimensional space groups 
arise as the symmetry groups of familiar patterns of rectangles, which we 
may think of as bricks or tiles. The generators are indicated as follows: a 
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broken line denotes a mirror, a “lens” denotes a half-turn, a small square 
denotes a quarter-turn (i.e., rotation through 90°), and a “half arrow” de- 
notes a glide reflection. 

In each case, a convenient fundamental region is indicated by shading. 
This region is to some extent arbitrary except in the case of pmm, where it 
is entirely bounded by mirrors. 

The procedure for analysing such a pattern is as follows. We observe that 
the symmetry group of a single brick is D2 (of order 4), which has subgroups 
C2 and D,. If all the symmetry operations of the brick are also symmetry 
operations of the whole pattern, as in ¢mm and pmm, the fundamental 
region is a quarter of the brick, two of the generators are the reflections that 
generate Dz, and any other generator transforms the original brick into a 
neighboring brick. If only the subgroup C2 or D, belongs to the whole pat- 


As 


Plate li 
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pmg Pgs 


Figure 4.4a 


tern (the way C2 belongs to p2 or pgg, and D; to pmg or p4g), the funda- 
mental region is half a brick, and the generators are not quite so obvious. 


EXERCISE 


In all these patterns it is understood that a “brick” is a rectangle in which one side is 
twice as long as another. In each case, any brick is related to the whole pattern in the 
same way as any other. (In technical language, the symmetry group is transitive on the 
bricks.) Are these six the on/y transitive patterns of bricks? 


4.5 THE CRYSTALLOGRAPHIC RESTRICTION 


A mathematician, like a painter or a poet, is a maker of patterns. If his 
patterns are more permanent than theirs, it is because they are made 
with ideas. 


G.H. Hardy [2, p. 24] 


A complete account of the enumeration of the seventeen two-dimensional 
space-groups would occupy too much space. But it seems worthwhile to give 
Barlow’s elegant proof * that the only possible cyclic subgroups are Co, C3, 
C4, and Cg. In other words: 

The only possible periods for a rotational symmetry operation of a lattice are 
2, 3, 4, 6. 

Let P be any center of rotation of period n. The remaining symmetry 


*W. Barlow, Philosophical Magazine (6), 1 (1901), p. 17. 
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operations of the lattice transform P into infinitely many other centers of 
rotation of the same period. Let Q be one of these other centers (Figure 
4.5a) at the least possible distance from P. A third center, P’, is derived 
from P by rotation through 27/n about Q; and a fourth, Q’, is derived from 
Q by rotation through 27/n about P’. Of course, the segments PO, QP’, 
P’Q’, are all equal. It may happen that P and Q’ coincide; thenn = 6. In 
all other cases, since Q was chosen at the least possible distance from P, we 
must have PQ’ > PQ; thereforen < 4. (ifn = 4, POP’Q’ is asquare. Ifn 
= 5, PQ’ is obviously shorter than PQ. Ifn > 6, PO crosses P’Q’, but it is 
no longer ncvessary to use Q’: we already have PP’ < PQ, which is suffi- 
ciently absurd.) 


Figure 4.5a 


EXERCISES 
1. IfS and T are rotations through 27/n about P and Q, what is T~1ST? 
2. Ifa discrete group of isometries includes two rotations about distinct centers, 


it includes two such rotations having the same period, and therefore also a translation. 
If this period is greater than 2, it includes two independent translations. 


4.6 REGULAR TESSELLATIONS 


The mathematician's patterns, like the painter's or the poet's, must be 
beautiful; the ideas, like the colours or the words, must fit together in 
a harmonious way. Beauty is the first test: there is no permanent place 
in the world for ugly mathematics. 


G. H. Hardy [2, p. 25] 


It was probably Kepler (1571-1630) who first investigated the possible 
ways of filling the plane with equal regular polygons. We shall find it con- 
venient to use the Schlafli symbol {p, q} for the tessellation of regular p-gons, 
q surrounding each vertex [Schlafli 1, p. 213]. The cases 


{6,3}, {4,4}, {3,6} 


are illustrated in Figure 4.6a, where in each case the polygon drawn in heavy 
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lines is the vertex figure: the g-gon whose vertices are the midpoints of the ¢ 
edges at a vertex. (Since tessellations are somewhat analogous to polyhedra, 
it is natural to use the word edges for the common sides of adjacent polygons, 
and faces for the polygons themselves.) 


eee Be 


14, 44 13, 6 
Figure 4.6a 


For a formal definition, we may say that a tessellation is regular if it has 
regular faces and a regular vertex figure at each vertex. 

The tessellation {6, 3} is often used for tiled floors in bathrooms. It can 
also be seen in any beehive. {4, 4} is familiar in the form of squared paper; 
in terms of Cartesian coordinates, its vertices are just the points for which 
both x and y are integers. {3, 6} is the dual of {6, 3} in the following sense. 
The dual of { p, q} is the tessellation whose edges are the perpendicular bi- 
sectors of the edges of {p, q} (see Figure 4.6b). Thus the dual of {p, qg} is 
{q, p}, and vice versa; the vertices of either are the centers of the faces of 
the other. In particular, the dual of {4, 4} is an equal {4, 4}. 


Figure 4.6b 


The possible values of p and gq are easily obtained by equating the angle 
of a p-gon, namely (1 — 2/p)z, to the value it must have if g such polygons 
come together at a vertex: 


ee ae areca 
q P q 2 
(Pp - dia —2)= 
The three possible ways of factorizing 4, namely 
4-1, 2°2, 1°4, 
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yield the three tessellations already described. However, before declaring 
that these are the only regular tessellations, we should investigate the frac- 
tional solutions of our equation; for there might conceivably be a regular 
“star” tessellation {p, g} whose face {p} and vertex figure {q} are regular 
polygons of the kind considered in §2.8. For instance, Figure 4.6c shows 
ten pentagons placed together at a common vertex. Although they over- 
lap, we might expect to be able to add further pentagons so as to form a tes- 
sellation {5, 42} (whose vertex figure is a decagram), covering the plane a 
number of times. But in fact this number is infinite, as we shall see. 

Consider the general regular tessellation {p, gq}, where p = n/d. Ifit cov- 
ers the plane only a finite number of times, there must be a minimum dis- 
tance between the centers of pairs of faces. Let P, Q be two such centers 
at this minimum distance apart. Since they are centers of rotation of period 
n, the argument used in §4.5 proves that the only possible values of n are 3, 4, 
6. Thus d = 1, and these are also the only possible values of p. Hence there 
are no regular star tessellations {Coxeter 1, p. 112}. 

It is actually possible to cover a sphere three times by using twelve “penta- 
gons” whose sides are arcs of great circles [Coxeter 1, p. 111]. 


Figure 4.6c 


To find the symmetry group of a regular tessellation, we treat its face the 
way we treated one of the bricks in §4.4. Clearly, the symmetry group of 
{p, q} is derived from the symmetry group D, of one face by adding the re- 
flection in a side of that face. Thus it is generated by reflections in the sides 
of a triangle whose angles are 7/p (at the center of the face), 7/2 (at the mid- 
point of an edge), and /g (at a vertex). This triangle is a fundamental 
region, since it is transformed into neighboring triangles by the three gen- 
erating reflections. Since each generator leaves invariant all the points on 
one side, the fundamental region is unique: it cannot be modified by addi- 
tion and subtraction the way Escher modified the fundamental regions of 
some other groups. 

The network of such triangles, filling the plane, is cut out by all the lines 
of symmetry of the regular tessellation. The lines of symmetry include the 
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lines of the edges of both { p, g} and its dual {q, p}. In the case of {6, 3} 
and {3, 6} (Figure 4.65), these edge lines suffice; in the case of the two dual 
{4, 4}’s we need also the diagonals of the squares. In Figure 4.6d, alternate 
regions have been shaded so as to exhibit both the complete symmetry 
groups p6m, p4m and the “direct” subgroups p6, p4 (consisting of rota- 
tions and translations) which preserve the colors and the direction of the 
shading [Brewster 1, p. 94; Burnside 1, pp. 416, 417]. 

Instead of deriving the network of triangles from the regular tessellation, 
we may conversely derive the tessellation from the network. For this pur- 
pose, we pick out a point in the network where the angles are 7/p, that is, 
where p shaded and p white triangles come together. These 2p triangles com- 


bine to form a face of {p, q}. 


Yl 


p6 and p6m 


Figure 4.6d 


EXERCISES 


1, Justify the formal definition of “regular” on page 62. (It implies that the faces 
are all alike and that the vertices are all surrounded alike.) 

2. Give a general argument to prove that the midpoints of the edges of a regular 
tessellation belong to a lattice. (Hint: Consider the group p2 generated by half-turns 
about three such midpoints.) 

3. Pick out the midpoints of the edges of {6,3}. Verify that they belong to a lat- 
tice. Do they constitute the whole lattice? 

4. Draw portions of lattices whose symmetry groups are p2, pmm, cmm, p4m, 


pdm. 
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4.7 SYLVESTER’S PROBLEM OF COLLINEAR POINTS 


Reductio ad absurdum, which Euclid loved so much, is one of a mathe- 
matician’s finest weapons. If is a far finer gambit than any chess gam- 


bit: a chess player may offer the sacrifice of a pawn or even a piece, 
but a mathematician offers the game. 


G.H. Hardy [2, p. 34] 


As we Saw in § 4.1, a lattice is a discrete set of points having the property 
that the line joining any two of them contains not only these two but infi- 
nitely many. Figure 4.7a shows a finite “orchard” in which nine points are 
arranged in ten rows of three [Ball 1, p. 105]. It was probably the investi- 


gation of such configurations that led Sylvester* to propose his problem of 
1893: 


Prove that it is not possible to arrange any finite number of real points so that a right 


line through every two of them shall pass through a third, unless they all lie in the same 
right line. 


Figure 4.7a Figure 4.7b 


Neither Sylvester nor any of his contemporaries were able to think of a 
satisfactory proof. The question was forgotten till 1933, when Karamata 
and Erdés revived it, and T. Gallai (alias Griinwald) finally succeeded, 
using a rather complicated argument. Sylvester’s “negative” statement was 
rephrased “positively” by Motzkin: 


If n points in the real plane are not on one straight line, then there exists a straight line 
containing exactly two of the points. 


The following proof, which somewhat resembles Barlow’s proof of the 
crystallographic restriction (§ 4.5), is due to L. M. Kelly. 


* J. J. Sylvester, Mathematical Questions and Solutions from the Educational Times, 59 (1893), 
p. 98 (Question 11851). See also R. Steinberg, American Mathematical Monthly, 51 (1944), p. 
170; L. M. Kelly, ibid., 55 (1948), p. 28; T. Motzkin, Transactions of the American Mathematical 


Society, 70 (1951), p. 452; L. M. Kelly and W. O. J. Moser, Canadian Journal of Mathematics, 
10 (1958), p. 213. 
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The v points Py,..., P, are joined by at most 4 n(n — 1) lines Py Pe, P1P3, 
etc. Consider the pairs P;, P;P;, consisting of a point and a joining line 
which are not incident. Since there are at most 4n(n — 1)(n — 2) such 
pairs, there must be at least one, say P1, P2P3, for which the distance P1Q 
from the point to the line is the smallest such distance that occurs. 

Then the line P2P3 contains no other point of the set. For if it contained 
P4, at least two of the points Ps, P3, Ps would lie on one side of the per- 
pendicular P,Q (or possibly one of the P’s would coincide with Q). Let the 
points be so named that these two are P2, P3, with Pz nearer to Q (or coin- 
cident with Q). Then P2, P3P, (Figure 4.7b) is another pair having a 
smaller distance than P1Q, which is absurd. 


This completes the proof that there is always a line containing exactly 
two of the points. Of course, there may be more than one such line; in fact, 
Kelly and Moser proved that the number of such lines is at least 3n/7. 


EXERCISES 
1. The above proof yields a line P2P3 containing only these two of the P’s. The 
point Q actually lies between Py and P3. 
2. If points are not all on one line, they have at least n distinct joins [Coxeter 2, 
p. 31]. 
3. Draw a configuration of n points for which the lower limit of 3n/7 “ordinary” 
joins is attained. (Hint: n = 7.) 


J 


Similarity in the Euclidean plane 


In later chapters we shall see that Euclidean geometry is by no means the 
only possible geometry: other kinds are just as logical, almost as useful, and 
in some respects simpler. According to the famous Erlangen program 
(Klein’s inaugural address at the University of Erlangen in 1872), the cri- 
terion that distinguishes one geometry from another is the group of trans- 
formations under which the propositions remain true. In the case of Eu- 
clidean geometry, we might at first expect this to be the continuous group 
of all isometries. But since the propositions remain valid when the scale 
of measurement is altered, as in a photographic enlargement, the “principal 
group” for Euclidean geometry [Klein 2, p. 133] includes also “similarities” 
(which may change distances although of course they preserve angles). In 
the present chapter we classify such transformations of the Euclidean plane. 
In particular, “dilatations” will be seen to play a useful role in the theory 
of the nine-point center of a triangle. These and other “direct” similarities 
are treated in the standard textbooks, but “opposite” similarities (§ 5.6) seem 
to have been sadly neglected. 


5.1 DILATATION 


“If | eat one of these cakes,"’ she thought, “it's sure to make some 
change in my size.‘'.. . So she swallowed one... and was delighted 
to find that she began shrinking directly. 


Lewis Carroll 
[Dodgson 1, Chap. 4] 


It is convenient to extend the usual definition of parallel by declaring that 
two (infinite straight) lines are parallel if they have either no common point 
or two common points. (In the latter case they coincide.) This conven- 
tion enables us to assert that, without any exception, 
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5.11 For each point A and line r, there is just one line through A parallel 
to r. 


Two figures are said to be homothetic if they are similar and similarly 
placed, that is, if they are related by a dilatation (or “homothecy”), which 
may be defined as follows [Artin 1, p. 54]: 

A dilatation is a transformation which preserves (or reverses) direction: 
that is, it transforms each line into a parallel line. 


A’ Q B’ 


Figure 5.1a 


5.12 Two given parallel line segments AB, A’ B’ are related by a unique 
dilatation AB — A’B’. 


For, any point P not on AB is transformed into the point P’ in which the 
line through A’ parallel to AP meets the line through B’ parallel to BP (Fig- 
ure 5.1a); and any point Q on AB is transformed into the point Q’ in which 
A’B’ meets the line through P’ parallel to PQ. 

In other words, a dilatation is completely determined by its effect on any 
two given points [Coxeter 2, 8.51]. 

Clearly, the inverse of the dilatation AB — A’B’ is the dilatation 
A’'B' + AB. Also AB > AB is the identity, AB > BA is a half-turn (about 
the midpoint of AB), and if ABB’A’ is a parallelogram, AB > A’B’ is a trans- 
lation. 


Figure 5.1b 


For any dilatation which is not the identity, the two points A and B may 
be so chosen that A is not an invariant point and AB is not an invariant 
line. Such a dilatation AB —> A’B’ (Figure 5.15) transforms any point P 
on AA’ into a point P’ on the parallel line through A’, which is AA’ itself. 
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Similarly, it transforms any point Q on BB’ into a point Q’ on BB’. If AA’ 
and BB’ are not parallel, these two invariant lines intersect in an invariant 
point O. Hence 


5.13 Any dilatation that is not a translation has an invariant point. 


This invariant point O is unique. For, a dilatation that has two invariant 
points Oz and Oz can only be the identity, which may reasonably be re- 
garded as a translation, namely a translation through distance zero [Weyl 
1, p. 69}. 

Clearly, any point P is transformed into a point P’ on OP. Let us write 


OP = OP, 


with the convention that the number A is positive or negative according as 
P and P”’ are on the same side of O or on opposite sides. With the help of 
some homothetic triangles (as in Figure 5.15), we see that A is a constant, 
that 1s, independent of the position of P. Moreover, any segment PQ is 
transformed into a segment | A | times as long, and oppositely directed if 
A <0. We shall use the symbol O(A) for the dilatation with center O and 
ratio A. (Court [2, p. 40] prefers “(O, A)’”.) 

In particular, O(1) is the identity and O(— 1) is a half-turn. Clearly, the 
only dilatations which are also isometries are half-turns and translations. 
In the case of a translation, such a symbol as O(A) is no longer available. 


Figure 5.1¢ 


EXERCISES 


1. What is the inverse of the dilatation O(A)? 

2. If the product O1(Az) and Oz(Az) is O(A1Az), where is O? 

3. Express the dilatation O(A) in terms of (a) polar coordinates, (b) Cartesian co- 
ordinates. 

ae Explain the action of the pantograph (Figure 5.1c), an instrument invented by 
Christoph Scheiner about 1630 for the purpose of making a copy, reduced or enlarged, 
of any given figure. It is formed by four rods, hinged at the corners of a parallelo- 
&ram AA’BC whose angles are allowed to vary. The three collinear points O, P, P’, 
On the respective rods AA’, AC, A’B, remain collinear when the shape of the parallelo- 
gram is changed. The instrument is pivoted at O. When a pencil point is inserted at 
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P’ and a tracing point at P (or vice versa), and the latter is traced over the lines of a 
given figure, the pencil point draws a homothetic copy. The positions of O and P are 
adjustable on their respective rods so as to allow various choices of the ratio OA : OA’. 
(Care must, of course, be taken to keep O and P collinear with P’.) 

5. How could the pantograph be modified so as to yield a dilatation O(A) with A 
negative? 


5.2 CENTERS OF SIMILITUDE 


| have often wondered why “‘similitude’’ ever got into elementary 
geometry.... {'m sure youngsters would be much more at ease with 
9 pair of circles if they just had centers of ‘‘similarity” instead of being 
made to imagine that some new idea was insinuating itself. 


E. H. Neville (1889 -1961) 


Figure 5.2a 


A dilatation O(A), transforming C into C’, transforms a circle with center 
C and radius r into a circle with center C’ and radius |A|r. Conversely, 
as we see in Figure 5.2a, if two circles have distinct centers C, C’ and un- 
equal radii a, a’, they are related by two dilatations, O(a@’/a) and O1(—a’/a), 
whose centers O and QO, divide the segment CC” externally and internally 
in the ratio a: a’ [Court 2, p. 184]. These points O and Oy, are called the 
centers of similitude of the two circles. To construct them, we draw an 
arbitrary diameter PCP, of the first circle and a parallel radius C’P’ of the 
second (with P’ on the same side of CC’ as P); then O lies on PP’, and O, 
on P1P’. 

If two circles are concentric or equal, they are still related by two dilata- 
tions, but there is only one center of similitude. In the case of concentric 
circles this is because the two dilatations have the same center. In the case 
of equal circles it is because one of the dilatations is a translation, which 
has no center. (The other is the half-turn about O01, which is now the mid- 
point of CC’) 


A. Vandeghen and G. R. Veldkamp (American Mathematical Monthly, 71 (1964), p, 
178) found that, for the triangle considered in Exercise 10 of § 1.5 (page 16), the centers 
of similitude of the two “Soddy circles” are the incenter and the Gergonne point: the 
point of concurrence of the lines joining the vertices to the points of contact of the re- 
spectively opposite sides with the incircle. , 


EXERCISES 


1. Iftwo equal circles have no common point, they have two parallel common tan- 
gents and two other common tangents through O, (midway between the centers). If 
they touch they have only three common tangents. If they intersect they have only 
the two parallel common tangents. 

2. Any common tangent of two unequal circles passes through a center of simili- 
tude. Sketch the positions of the centers of similitude, and record the number of com- 
mon tangents, in the five essentially different instances of two such circles. (Two of 
the five are shown in Figure 5.2a.) 

3. Given two dilatations O(A), O1(A1), with A Ay, describe the position of the 
unique point C on which both have the same effect. 


5.3 THE NINE-POINT CENTER 


Consider an arbitrary triangle ABC, with circumcenter O, centroid G, and 
orthocenter H. Let A’, B’, C’ be the midpoints of the sides, and A”, BY”, 
C” the midpoints of the segments HA, HB, HC, as in Figure 1.7a. Clearly, 
both the triangles A’B’C’, A’ B”’C” are homothetic to ABC, being derived 
from ABC by the respective dilatations G(—4), HG). The former provides 
a new proof that the medians are concurrent and trisect one another. 

Since G(—4) and H(4) are the two dilatations by which the nine-point 
circle can be derived from the circumcircle [Court 2, p. 104], the points 
G, H are the centers of similitude of these two circles, and the Euler line 
GH contains the centers of both circles: not only the circumcenter O, as 
we know already, but also the nine-point center N. Since the values of p 
for the dilatations are +4, the nine-point radius is half the circumradius, 
and the centers of similitude H, G divide the segment ON externally and 
internally in the ratio 2 : 1 (Figure 5.3a). Thus N is the midpoint of OH. 


O G N H 


Figure 5.3a 


EXERCISES 
1. Using Cartesian coordinates, find the ordinates y of the centers O, G, N, H of 
the isosceles triangle whose vertices are (0, 10), (+6, —8). 


2. If ABCH is an orthocentric quadrangle (see 1.72), the four Euler lines of the tt 
ansies RCH. CAH ARH ARC are enneurrent 
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5.4 THE INVARIANT POINT OF A SIMILARITY 


When a figure is enlarged so as fo remain still of the same shape, 
every straight line in it remains a straight line, and every angle re- 
mains congruent fo itself. All the parts of the figure are equally en- 
larged. When one figure is an enlarged copy of another, the two 
are said to be similar. The degree of enlargement necessary to make 
one figure equal to the other is called their ratio of similitude. The 
ratio of two lines in the one figure is equal to the ratio of the two 
corresponding fines in the other. 


W. K. Clifford (1845 -1879)} 
(Mathematical Papers, p. 631) 


A similarity (or “similarity transformation,” or “similitude’’) is a trans- 
formation which takes each segment AB into a segment A’B’ whose length 
is given by 

A'B’ 


AB = B, 


where y is a constant positive number (the same for all segments) called the 
ratio of magnification (Clifford’s “ratio of similitude’’). It follows that any 
triangle is transformed into a similar triangle, and any angle into an equal 
(or opposite) angle. When p = 1, the similarity is an isometry. Other spe- 
cial cases are the dilatations O(p). 


A 


Figure 5.4a Figure 5.4b 


A less familiar instance is the dilative rotation (or “spiral similarity”, 
Figure 5.4a), which is the product of a dilatation O(u) and a rotation about 
O. Another is the dilative reflection (Figure 5.4b), which is the product of a 
dilatation O(z) and the reflection in a line through O. We would not obtain 
anything new (in either case) if we replaced this dilatation O(u) by O(—p). 
For, since O(— p) = O(—1) + O(u), and O(— 1) is a half-turn, the product of 
O(p) and a rotation through a about O is the same as the product of O( —}4) 
and a rotation through a + 7; and since O(~—1) is the product of two per- 
pendicular reflections, the product of O(u) and the reflection in a line m 
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through QO is the same as the product of O( — 4) and the reflection in the line 
through O perpendicular to m. In fact, a dilative reflection has two per- 
pendicular invariant lines (its axes), which are the internal and external bi- 
sectors of 7 AOA’ (and of 2 BOB’). 

Clearly (cf. 3.11), 


5.41 Any two similar triongles ABC, A’ B’C’ are related by a unique simi- 
larity ABC — A’B’C’, which is direct or opposite according as the sense of 
A'B'C’ agrees or disagrees with that of ABC. 


In other words, a similarity is completely determined by its effect on any 
three given non-collinear points. For instance, the two triangles CBF, ACF 
used in proving Pythagoras’s theorem (Figure 1.3¢) are related by a dilative 
rotation, the product of the dilatation F(AC/CB) and a quarter-turn; and 
the two triangles ABC, ACF (in the same figure) are related by a dilative 
reflection whose axes are the bisectors of the angle A. 

Here is another way of expressing the same idea: 

Any two line segments AB, A’B’, are related by just two similarities: one 
direct and one opposite. 

For instance, the segment AB can be completed to make a square ABCD 
on either side of the line AB, and similarly there are two ways to place a 
square A’B’C’D’ on A’B’. The similarity 

ABCD —> A'B’C'D' 


is direct or opposite according as the senses round these two squares agree 
or disagree. 


oF, 
Boo C 
Q 


Figure 5.4¢ 


Theorem 5.13 suggests the possibility that every similarity with »+~ 1 
may have an invariant point. 

If a given similarity is not a dilatation, there must be at least one line trans- 
formed into a nonparallel line. Let 4B and A’B’ be corresponding segments 
on such a pair of lines, and let the given similarity (direct or opposite) be 
determined by similar, but not congruent, parallelograms ABCD and 
A'B'C'D’ (for example, by squares, as above). 
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Let P, Q, R, S denote the points of intersection of the pairs of correspond- 
ing lines AB and A’B’, BC and B’C’, CD and C’D’, DA and D'A’, as in 
Figure 5.4c. Suppose the given similarity transforms P (on AB) into P’ (on 
A’B’), and R (on CD) into R’ (on C’D’). Let O be the common point of the 
lines PR, P’R’ (which cannot be parallel, for, if they were, PRR’ P’ would be 
a parallelogram, and the segments PR, P’R’ would be congruent, contra- 
dicting P’R’ = »PR). Since the point pairs PP’ and RR’ lie on parallel 
lines A’B’ and C’D’, 

oP _ OP’ 

OR” OR 
Therefore, the similarity leaves O invariant. Moreover, O is its only invari- 
ant point. For, if a similarity with » + 1 had two invariant points O; and 
Oz, the distance 0102 would be left unchanged instead of being multiplied 
by pw. Hence 


5.42 Any similarity that is not an isometry has just one invariant point. 


Figure 5.4d 


Moreover, given two similar parallelograms ABCD and A’ B’C’D’, we can 
use the method indicated in Figure 5.4d to construct the center of the simi- 
larity that relates them. For, having seen that O lies on the line PR, we can 
apply the same reasoning (using BC and DA instead of AB and CD) to show 
that O lies on the line QS. This is a different line, for, if P, Q@, R, S were all 
collinear we would have 


PA _PS _PA' RC _ RQ _ RC 

PB” PQ” PB RD~ RS~ RD” 
making both P and R invariant. Hence, O can be constructed as the point of 
intersection of the lines PR and QS. 


and 
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EXERCISE 


How can the idea of continuity be used for a different proof of Theorem 5.42? 


5.5 DIRECT SIMILARITY 


Consider a given direct similarity whose ratio of magnification p is not 1. 
Since there is an invariant point, say O, this similarity may be expressed as 
the product of the dilatation O(w) and a direct isometry leaving O invariant. 
By Theorem 3.14, such an isometry is simply a rotation about O. Hence, 

5.51 Any direct similarity that is not an isometry is a dilatation or a dila- 
tive rotation. 


EXERCISES 


1. What is the product of two dilative rotations? 
2. How can two circles be used to locate the invariant point of the direct similarity 
that relates two given incongruent segments on nonparallel lines? [Casey 1, p. 186.] 


5.6 OPPOSITE SIMILARITY 


Consider a given opposite similarity whose ratio of magnification p is not 
1. Since there is an invariant point, say O, this similarity may be expressed 
as the product of the dilatation O(u) and an opposite isometry leaving O 
invariant. By Theorem 3.14, such an isometry is simply the reflection in a 
line through O. Hence 


5.61 Any opposite similarity that is not an isometry is a dilative reflection. 


Figure 5.6a 
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1. If two maps of the same country on different scales are drawn on tracing paper 
and superposed, there is just one place that is represented by the same spot on both 
maps. (It is understood that one of the maps may be turned over before it is super- 
posed on the other.) [Lachlan 1, pp. 137, 139.] 

2. When all the points P on AB are related by a similarity to all the points P’ 
on A‘B’, the points dividing the segments PP’ in the ratio AB: A‘B’ (internally or ex- 
ternally) are distinct and collinear or else they all coincide. 

3. IfS is an opposite similarity, S2 is a dilatation. 

4. What is the product (a) of two dilative reflections? (b) of a dilative rotation and 
a dilative reflection? 

5. Let AB and A’B’ be two given segments of different lengths. Let A, and Ae 
divide AA’ internally and externally in the ratio AB:A’B’ (as in Figure 5.6a). Let By 
and B, divide BB’ in the same manner. Then the lines AyB, and A2Be are at right 
angles, and are the axes of the dilative reflection that transforms AB into A’B’. [Lachlan 
1, p. 134; Johnson 1, p. 27.] (It has been tacitly assumed that A; ~ B, and A2 ~ Bo. 
However, if A2 and Be coincide, the axes are A,B, and the perpendicular line through 
Ap.) 

6. Describe the transformation 


(r, 9) > (ur, 6 + a) 
of polar coordinates, and the transformation 
(x, y) = (ux, = By) 


of Cartesian coordinates. 


6 


Circles and spheres 


The present chapter shows how Euclidean geometry, in which lines and 
planes play a fundamental role, can be extended to inversive geometry, in 
which this role is taken over by circles and spheres. We shall see how the 
obvious statement, that lines and planes are circles and spheres of infinite 
radius, can be replaced by the sophisticated statement that lines and planes 
are those circles and spheres which pass through an “ideal” point, called 
“the point at infinity.” In § 6.9 we shall briefly discuss a still more unusual 
geometry, called elliptic, which is one of the celebrated “non-Euclidean”’ 
geometries. 


6.1 INVERSION IN A CIRCLE 


Can it be that all the great scientists of the past were really playing 
a game, a game in which the rules are written not by man but by 
God?... When we play, we do not ask why we are playing—we 
just play. Play serves no moral code except that strange code which, 
for some unknown reason, imposes itself on the play.... You will 
search in vain through scientific literature for hints of motivation. And 
as for the strange moral code observed by scientists, what could be 
stranger than an abstract regard for truth in a world which is full of 
concealment, deception, and taboos? ... In submitting to your con- 
sideration the idea that the human mind is at its best when playing, 
lam myself playing, and that makes me feel! that what | am saying may 
have in it an element of truth. 


J. L. Synge (1897 - * 


All the transformations so far discussed have been similarities, which 
transform straight lines into straight lines and angles into equal angles. The 
transformation called inversion, which was invented by L. J. Magnus in 
1831, is new in one respect but familiar in another: it transforms some 

* Hermathena, 19 (1958), p. 40; quoted with the editor’s permission. 
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straight lines into circles, but it still transforms angles into equal angles. 
Like the reflection and the half-turn, it is involutory (that is, of period 2). 
Like the reflection, it has infinitely many invariant points; these do not lie 
on a straight line but on a circle, and the center of the circle is “‘singular:” 
it has no image! 


Pp 


Figure 6.1a 


Given a fixed circle with center O and radius k, we define the inverse of 
any point P (distinct from O) to be the point P’ on the ray OP whose dis- 
tance from O satisfies the equation 


OP x OP = F?. 


It follows from this definition that the inverse of P’ is P itself. Moreover, 
every point outside the circle of inversion is transformed into a point in- 
side, and every point inside (except the center O) into a point outside. The 
circle is invariant in the strict sense that every point on it is invariant. Every 
line through O is invariant as a whole, but not point by point. 

To construct the inverse of a given point P (other than O) inside the cir- 
cle of inversion, let T be one end of the chord through P perpendicular to 
OP, as in Figure 6.la. Then the tangent at 7 meets OP (extended) in the 
desired point P’. For, since the right-angled triangles OPT, OTP’ are simi- 
lar, and OT = k, 

OP _ek 
k ~ OP" 


To construct the inverse of a given point P’ outside the circle of inver- 
sion, let T be one of the points of intersection of this circle with the circle 
on OP’ as diameter (Figure 6.1a). Then the desired point P is the foot of 
the perpendicular from T to OP’. 

If OP > &k, the inverse of P can easily be constructed by the use of com- 
passes alone, without a ruler. To do so, let the circle through O with cen- 
ter P cut the circle of inversion in Q and Q’. Then P’ is the second inter- 
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section of the circles through O with centers Q and Q’. (This is easily seen 
by considering the similar isosceles triangles POQ, QOP’.) 
There is an interesting connection between inversion and dilatation: 


6.11 The product of inversions in two concentric circles with radii k and 
k’ is the dilatation O(j) where p = (k’/k)?. 
To prove this, we observe that this product transforms P into P” (on OP) 


where 
OP x OP = k?, OP’ x OP" = k? 


and therefore OP” _ ( k’\? 
OP k 
EXERCISES 


1. Using compasses alone, construct the vertices of a regular hexagon. 

2. Using compasses alone, locate a point B so that the segment OB is twice as long 
as a given segment OA. 

3. Using compasses alone, construct the inverse of a point distant 4k from the cen- 
ter O of the circle of inversion. Describe a procedure for inverting points arbitrarily 
near to O. 

4. Using compasses alone, bisect a given segment. 

5. Using compasses alone, trisect a given segment. Describe a procedure for di- 
viding a segment into any given number of equal parts. 

Note. The above problems belong to the Geometry of Compasses, which was de- 
veloped independently by G. Mohr in Denmark (1672) and L. Mascheroni in Italy 
(1797), For a concise version of the whole story, see Pedoe [1, pp. 23-25] or Courant 
and Robbins [1, pp. 145-151]. 


6.2 ORTHOGONAL CIRCLES 


A circle is a happy thing ta be— 
Think how the joyful perpendicular 
' Erected at the kiss of tangency 
Must meet my central paint, my avatar. 
And lovely as fam, yet only 3 
Points are needed to determine me, 


Christopher Marley (1890 - ) 


Two circles are said to be orthogonal if they cut at right angles, that is, if 
they intersect in two points at either of which the radius of each is a tan- 
gent to the other (Figure 6.2a). 

By Euclid 11.36 (see p. 8) any circle, through a pair of inverse points is 
invariant: the circle of inversion decomposes it into two arcs which invert 
into each other. Moreover, such a circle is orthogonal to the circle of in- 
version, and every circle orthogonal to the circle of inversion is invariant 
in this sense. Through a pair of inverse points we can draw a whole pencil 
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of circles (infinitely many), and they are all orthogonal to the circle of in- 
version. Hence 


6.21 The inverse of a given point P is the second intersection of any two 
circles through P orthogonal to the circle of inversion. 


Figure 6.2a 


The above remarks provide a simple solution for the problem of drawing, 
through a given point P, a circle (or line) orthogonal to two given circles. 
Let P1, Pe be the inverses of P in the two circles. Then the circle PP; P2 
(or the line through these three points, if they happen to be collinear) is 
orthogonal to the two given circles. 

If O and C are the centers of two orthogonal circles w and y, as in Figure 
6.2a, the circle on OC as diameter passes through the points of intersection 
T, U. Every other point on this circle is inside one of the two orthogonal 
circles and outside the other. It follows that, if a and b are two perpen- 
dicular lines through O and C respectively, either a touches y and b touches 
w, or a cuts y and b lies outside w, or a lies outside y and 6b cuts w. 


6.3 INVERSION OF LINES AND CIRCLES 


Figure 6.3a 
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We have seen that lines through O invert into themselves. What about 
other lines? Let A be the foot of the perpendicular from O to a line not 
through O. Let A’ be the inverse of A, and P’ the inverse of any other point 
Ponthe line. (See Figure 6.3a where, for simplicity, the circle of inversion 
has not been drawn.) Since 


OP x OP’ = k? = OA x OA’, 


the triangles OAP, OP’A’ are similar, and the line AP inverts into the circle 
on OA’ as diameter, which is the locus of points P’ from which OA’ sub- 
tends aright angle. Thus any line not through O inverts into a circle through 
O, and vice versa. 


Figure 6.3b 


Finally, what about a circle not through O? Let P be any point on such 
a circle, with center C, and let OP meet the circle again in Q. By Euclid 
111.35 again, the product 


p= OPx0OQ 


is independent of the position of P on the circle. Following Jacob Steiner 
(1796-1863), we call this product the power of O with respect to the circle. 
It is positive when O is outside the circle, zero when OQ lies on the circle, 
and we naturally regard it as being negative when O is inside (so that OP 
and OQ are measured in opposite directions). Let the dilatation O(k?/p) 
transform the given circle and its radius CQ into another circle (or possibly 
the same) and its parallel radius DP’ (Figure 6.3, cf. Figure 5.2a), so that 


OP’ _ OD _ Fk? 

OO OC p 
Since OP x OQ = p, we have, by multiplication, 

OP x OP = k?. 


Thus P’ is the inverse of P, and the circle with center D is the desired in- 


82 CIRCLES AND SPHERES 


verse of the given circle with center C. (The point D is usually not the in- 
verse of C.) 

We have thus proved that the inverse of a circle not through O is another 
circle of the same kind, or possibly the same circle again. The latter possi- 
bility occurs in just two cases: (1) when the given circle is orthogonal to the 
circle of inversion, so that p = k? and the dilatation is the identity; (2) when 
the given circle is the circle of inversion itself, so that p = — k? and the dilata- 
tion is a half-turn. 

When p is positive (see the left half of Figure 6.35), so that O is outside 
the circle with center C, this circle is orthogonal to the circle with center O 
and radius \/p; that is, the former circle is invariant under inversion with 
respect to the latter. In effect, we have expressed the given inversion as 
the product of this new inversion, which takes P to Q, and the dilatation 
O(k2/p), which takes Q to P’. When p is negative (as in the right half of 
Figure 6.35), P and Q are interchanged by an “anti-inversion:” the product 
of an inversion with radius \/ —p and a half-turn [Forder 3, p. 20]. 

When discussing isometries and other similarities, we distinguished be- 
tween direct and opposite transformations by observing their effect on a tri- 
angle. Since we are concerned only with sense, the triangle could have been 
replaced by its circumcircle. Such a distinction can still be made for inver- 
sions (and products of inversions), which transform circles into circles. In- 
stead of a triangle we use a circle: not an arbitrary circle but a “small” circle 
whose inverse is also “small,” that is, a circle not surrounding O. Referring 
again to the left half of Figure 6.3b, we observe that P and Q describe the 
circle with center C in opposite senses, whereas Q and P’ describe the two 
circles in the same sense. Thus the inverse points P and P’ proceed oppo- 
sitely, and 

Inversion is an opposite transformation. 

It follows that the product of an even number of inversions is direct. One 
instance is familiar: the product of inversions with respect to two concentric 
circles is a dilatation. 


Figure 6.3¢ Figure 6.3d 


EXERCISES 


1. For any two unequal circles that do not intersect, one of the two centers of simili- 
tude (§ 5.2) is the center of a circle which inverts either of the given circles into the 
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other. For two unequal intersecting circles, both centers of similitude have this prop- 
erty. What happens in the case of equal intersecting circles? 

2. Explain the action of Peaucellier’s cell (Figure 6.3c), an instrument invented by 
A. Peaucellier in 1864 for the purpose of drawing the inverse of any given locus. It is 
formed by four equal rods, hinged at the corners of a rhombus APBP’, and two equal 
(longer) rods connecting two opposite corners, A and B, to a fixed pivot O. Whena 
pencil point is inserted at P’ and a tracing point at P (or vice versa), and the latter is 
traced over the curves of a given figure, the pencil point draws the inverse figure. In 
particular, if a seventh rod and another pivot are introduced so as to keep P on a cir- 
cle passing through O, the locus of P’ will be a straight line. This linkage gives an 
exact solution of the important mechanical problem of converting circular into rec- 
tilinear motion. [Lamb 2, p. 314] 

3. Explain the action of Hart’s linkage (Figure 6.3d), an instrument invented by 
H. Hart in 1874 for the same purpose as Peaucellier’s cell. It requires only four rods, 
hinged at the corners of a “crossed parallelogram” ABCD (with AB = CD, BC = DA). 
The three collinear points O, P, P’, on the respective rods AB, AD, BC, remain col- 
linear (on a line parallel to AC and BD) when the shape of the crossed parallelogram 
is changed. As before, the instrument is pivoted at O. [Lamb 2, p. 315.] 

4. With respect to a circle y of radius r, let p be the power of an outside point O. 
Then the circle with center O and radius k inverts y into a circle of radius k?r/p. 


6.4 THE INVERSIVE PLANE 


Whereupon the Plumber said in tones of disgust: 
"| suggest that we proceed at once to infinity.” 


J. L. Synge [2, p. 131] 


We have seen that the image of a given point P by reflection in a line 
(Figure 1.35) is the second intersection of any two circles through P orthog- 
onal to the mirror, and that the inverse of P in a circle is the second inter- 
section of any two circles through P orthogonal to the circle of inversion. 
Because of this analogy, inversion is sometimes called “reflection in a cir- 
cle” [Blaschke 1, p. 47], and we extend the definition of a circle so as to in- 
clude a straight line as a special (or “limiting”) case: a circle of infinite radius. 
We can then say that any three distinct points lie on a unique circle, and 
that any circle inverts into a circle. 

In the same spirit, we extend the Euclidean plane by inventing an “ideal” 
point at infinity O’, which is both a common point and the common center 
of all straight lines, regarded as circles of infinite radius. Two circles with 
a common point either touch each other or intersect again. This remains 
obvious when one of the circles reduces to a straight line. When both of 
them are straight, the lines are either parallel, in which case they touch at 
O’, or intersecting, in which case O’ is their second point of intersection 
[Hilbert and Cohn-Vossen 1, p. 251]. 
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We can now assert that every point has an inverse. All the lines through 
O, being “circles” orthogonal to the circle of inversion, meet again in O’, the 
inverse of O. When the center O is O’ itself, the “circle” of inversion is 
straight, and the inversion reduces to a reflection. 

The Euclidean plane with O’ added is called the inversive (or “conformal”) 
plane.* It gives inversion its full status as a “transformation” (§ 2.3): a one- 
to-one correspondence without exception. 

Where two curves cross each other, their angle of intersection is naturally 
defined to be the angle between their tangents. In this spirit, two intersect- 
ing circles, being symmetrical by reflection in their line of centers, make 
equal angles at the two points of intersection. This will enable us to prove 


6.41 Any angle inverts into an equal angle (or, more strictly, an opposite 
angle). 

We consider first an angle at a point P which is not on the circle of in- 
version. Since any direction at such a point P may be described as the di- 
rection of a suitable circle through P and its inverse P’, two such directions 
are determined by two such circles. Since these circles are self-inverse, they 
serve to determine the corresponding directions at P’. To show that an 
angle at P is still preserved when P is self-inverse, we use 6.11 to express 
the given inversion as the product of a dilatation and the inversion in a con- 
centric circle that does not pass through P. Since both these transforma- 
tions preserve angles, their product does likewise. 

In particular, right angles invert into right angles, and 


6.42 Orthogonal circles invert into orthogonal circles (including lines as 
special cases). 


Figure 6.4a 


By 6.21, inversion can be defined in terms of orthogonality. Therefore a 
circle and a pair of inverse points invert (in another circle) into a circle and 
a pair of inverse points. More precisely, if a circle y inverts P into Q and 

*M. Bécher, Bulletin of the American Mathematical Society, 20 (1914), p. 194. 
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a circle w inverts y, P, Q into y’, P’, Q’, then the circle y’ inverts P’ into Q’. 
An important special case (Figure 6.4a) arises when Q coincides with O, 
the center of w, so that Q’ is O’, the point at infinity. Then P is the inverse 
of O in y, and P’ is the center of y’. In other words, if y inverts O into P, 
whereas w inverts y and P into y’ and P’, then P’ is the center of y’. 

Two circles either touch, or cut each other twice, or have no common 
point. In the last case (when each circle lies entirely outside the other, or 
else one encloses the other), we may conveniently say that the circles miss 
each other. 

If two circles, ay and ag, are both orthogonal to two circles 8: and Bo, 
we can invert the four circles in a circle whose center is one of the points 
of intersection of a, and £1, obtaining two orthogonal circles and two per- 
pendicular diameters, as in the remark at the end of §6.2. Hence, either 
a, touches az and £,; touches fe, or a; cuts a2 and B1 misses Bo, or a; misses 
a2 and Bi cuts Bo. 


6.5 COAXAL CIRCLES 


In this section we leave the inversive plane and return to the Euclidean 
plane, in order to be able to speak of distances. 


Figure 6.5a 


If P and P’ are inverse points in the circle w (with center O), as in Figure 
6.5a, all the lines through P’ invert into all the circles through O and P: an 
intersecting (or “elliptic”) pencil of coaxal circles, including the straight line 
OPP’ as a degenerate case. The system of concentric circles with center P’, 
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consisting of circles orthogonal to these lines, inverts into a nonintersecting 
(or “hyperbolic’”) pencil of coaxal circles (drawn in broken lines). These 
circles all miss one another and are all orthogonal to the intersecting pen- 
cil. One of them degenerates to a (vertical) line, whose inverse is the circle 
(with center P’ ) passing through O. 


Figure 6.5b 


As a kind of limiting case when O and P coincide (Figure 6.5b), the cir- 
cles that touch a fixed line at a fixed point O constitute a tangent (or “para- 
bolic”) pencil of coaxal circles. They invert (in a circle with center O) into 
all the lines parallel to the fixed line. Orthogonal to these lines we have 
another system of the same kind, inverting into an orthogonal tangent pen- 
cil of coaxal circles. Again each member of either pencil is orthogonal to 
every member of the other. 

Any two given circles belong to a pencil of coaxal circles of one of these 
three types, consisting of all the circles orthogonal to both of any two circles 
orthogonal to both the given circles. (More concisely, the coaxal circles con- 
sist of all the circles orthogonal to all the circles orthogonal to the given 
circles.) Two circles that cut each other belong to an intersecting pencil 
(and can be inverted into intersecting lines); two circles that touch each 
other belong to a tangent pencil (and can be inverted into parallel lines); 
two circles that miss each other belong to a nonintersecting pencil (by the 
remark at the end of § 6.4). 

Each pencil contains one straight line (a circle of infinite radius) called 
the radical axis (of the pencil, or of any two of its members).* For an inter- 
secting pencil, this is the line joining the two points common to all the cir- 
cles (OP for the “unbroken” circles in Figure 6.5a). For a tangent pencil, 


* Louis Gaultier, Journal de ’Ecole Polytechnique, 16 (1813), p. 147. 
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it is the common tangent. For a nonintersecting pencil, it is the line mid- 
way between the two limiting points (or circles of zero radius) which are the 
common points of the orthogonal intersecting pencil. For each pencil there 
is a line of centers, which is the radical axis of the orthogonal pencil. Hence 


6.51 Iftangents can be drawn to the circles of a coaxal pencil from a point 
on the radical axis, all these tangents have the same length. 


The radical axis of two given circles may be defined as the locus of points 
of equal power (§ 6.3) with respect to the two circles. This power can be 
measured as the square of a tangent except in the case when the given cir- 
cles intersect in two points O, P, and we are considering a point A on the 
segment OP; then the power is the negative number AO x AP. 

It follows that, for three circles whose centers form a triangle, the three 
radical axes (of the circles taken in pairs) concur in a point called the radi- 
cal center, which has the same power with respect to all three circles. If 
this power is positive, its square root is the length of the tangents to any of 
the circles, and the radical center is the center of a circle (of this radius) 
which is orthogonal to all the given circles. But if the power is negative, 
no such orthogonal circle exists. 


Figure 6.5¢ 


The possibility of inverting any two nonintersecting circles into concentric 
circles (by taking O at either of the limiting points) provides a remarkably 
simple proof for Steiner’s porism:* If we have two (nonconcentric) circles, 
one inside the other, and circles are drawn successively touching them and 
one another, as in Figure 6.5c, it may happen that the ring of touching cir- 
cles closes, that is, that the last touches the first. Steiner’s statement is that, 
if this happens once, it will always happen, whatever be the position of the 
first circle of the ring. To prove this we need only invert the original two 
circles into concentric circles, for which the statement is obvious. 


* Forder [3, p. 23]. See also Coxeter, Interlocked rings of spheres, Scripta Mathematica, 18 
(1952), pp. 113-121, or Yaglom [2, p. 199]. 
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EXERCISES 

1. Ina pencil of coaxal circles, each member, used as a circle of inversion, inter- 
changes the remaining members in pairs and inverts each member of the orthogonal 
pencil into itself. 

2. The two limiting points of a nonintersecting pencil are inverses of each other in 
any member of the pencil. 

3. If two circles have two or four common tangents, their radical axis joins the mid- 
points of these common tangents. If two circles have no common tangent (i.e., if one 
entirely surrounds the other), how can we construct their radical axis? 

4. When a nonintersecting pencil of coaxal circles is inverted into a pencil of con- 
centric circles, what happens to the limiting points? 

5. In Steiner’s porism, the points of contact of successive circles in the ring all lie 
on a circle, and this will serve to invert the two original circles into each other. Do 
the centers of the circles in the ring lie on a circle? 

6. For the triangle considered in Exercise 10 of § 1.5 (page 16), the incircle is coaxal 
with the “two other circles” (Soddy’s circles). 


6.6 THE CIRCLE OF APOLLONIUS 


The analogy between reflection and inversion is reinforced by the follow- 
ing 

PROBLEM. To find the locus of a point P whose distances from two fixed 
points A, A’ are in a constant ratio | : p, so that 


A'P = pAP. 


Figure 6.6a 


When p». = |, the locus is evidently the perpendicular bisector of AA’, 
that is, the line that reflects A into A’. We shall see that for other values 
of p it is a circle that inverts A into A’. (Apollonius of Perga, c. 260-190 B.c.) 

Assuming p ~ |, let P be any point for which A’P = pAP. Let the in- 
ternal and external bisectors of 2 APA’ meet AA’ in A, and Ag (as in Fig- 
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ure 6.6a, where » = 4). Take E and F on AP so that A’E is parallel to 
AP and A’F is parallel to A2P, that is, perpendicular to A;P. Since FP 
= PA’ = PE, we have 


AA, _ AP _ AP AAg _ AP _ Ap 

A\A’ PE PA'’ A’An FP PA" 
(The former result is Euclid VL3.) Thus Ay and Ae divide the segment AA’ 
internally and externally in the ratio 1 : », and their location is independ- 
ent of the position of P. Since ZA ,PAp is a right angle, P lies on the cir- 
cle with diameter 41Ao. 

Conversely, if 41 and Ag are defined by their property of dividing AA’ in 

the ratio 1 : p, and Pis any point on the circle with diameter A1Az2, we have 


AP AA; _1__ AAz _ AP 

PE A;A’  A’An FP’ 
Thus FP = PE, and P, being the midpoint of FE, is the circumcenter of 
the right-angled triangle EFA’. Therefore PA’ = PE and 


AP _AP_| 
PA’ PE 


(Court 2, p. 15}. 
Finally, the circle of Apollonius AyA2P inverts A into A’. For, if O is its 
center and k its radius, the distances a = AO and a’ = A’O satisfy 


a—k _AAy AA, _atk 
k—-a@ AA’ A'Ag a +k’ 


whence ad’ = k?, 


EXERCISES 


1. When pu varies while A and A’ remain fixed, the circles of Apollonius form a non- 
intersecting pencil with A and A’ for limiting points. 

2. Given a line / and two points A, A’ (not on J), locate points P on / for which 
the ratio 4’P/AP is maximum or minimum. (Hint: Consider the circle through A, A’ 
with its center on /, The problem is due to N. S. Mendelsohn, and the hint to Rich- 
ard Blum.) 

3. Express k/AA’ in terms of p. 


4. In the notation of Figure 5.6a (which is embodied in Figure 6.6), the circles on 
A1Az and B,Be as diameters meet in two points O and O, such that the triangles OAB 
and OA’B’ are similar, and likewise the triangles OAB and OA’B’. Of the two simi- 
larities 

OAB — OA'B’ and OAB — OA’B’, 


one is opposite and the other direct. In fact, O is where A,B, meets A2Bo, and O lies 
on the four further circles AA’P, BBP, ABT, A’B’T (cf: Ex. 2 of § 5.5). [Casey 1, p. 
185.] If A’ coincides with B, O lies on AB’. 
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Figure 6.6b 


5. Let the inversive distance between two nonintersecting circles be defined as the 
natural logarithm of the ratio of the radii (the larger to the smaller) of two concentric 
circles into which the given circles can be inverted. Then, if a nonintersecting pencil 
of coaxal circles includes a1, a2, a3 (in this order), the three inversive distances satisfy 


(@1, &2) + (Ge, &3) == (A1, &3). 

6. Two given unequal circles are related by infinitely many dilative rotations and 
by infinitely many dilative reflections. The locus of invariant points (in either case) is 
the circle having for diameter the segment joining the two centers of similitude of the 
given circles. (This locus is known as the circle of similitude of the given circles.) What 
is the corresponding result for two given equal circles? 

7. The inverses, in two given circles, of a point on their circle of similitude, are 
images of each other by reflection in the radical axis of the two circles [Court 2, p. 199]. 


6.7 CIRCLE-PRESERVING TRANSFORMATIONS 


Having observed that inversion is a transformation of the whole inversive 
plane (including the point at infinity) into itself, taking circles into circles, we 
naturally ask what is the most general transformation of this kind. We dis- 
tinguish two cases, according as the point at infinity is, or is not, invariant. 

In the former case, not only are circles transformed into circles but also 
lines into lines. With the help of Euclid III.21 (see p. 7) we deduce that 
equality of angles is preserved, and consequently the measurement of angles 
is preserved, so that every triangle is transformed into a similar triangle, and 
the transformation is a similarity (§ 5.4). 

If, on the other hand, the given transformation T takes an ordinary point 
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O into the point at infinity O’, we consider the product J:T, where Jy is the in- 
version in the unit circle with center O, This product JT, leaving O’ invari- 
ant, is a similarity. Let k? be its ratio of magnification, and J, the inver- 
sion in the circle with center O and radius k. Since, by 6.11, J,J;, is the 
dilatation O(k?), the similarity J,T can be expressed as J,J,S, where S is an 
isometry. Thus 
T = JS, 
the product of an inversion and an isometry. 
To sum up, 


6.71 Every circle-preserving transformation of the inversive plane is either 
a similarity or the product of an inversion and an isometry. 


It follows that every circle-preserving transformation is the product of at 
most four inversions (provided we regard a reflection as a special kind of in- 
version) [Ford 1, p. 26]. Such a transformation is called a homography or an 
antihomography according as the number of inversions is even or odd. The 
product of two inversions (either of which could be just a reflection) is called 
a rotary ot parabolic or dilative homography according as the two inverting 
circles are intersecting, tangent, or nonintersecting (i.e., according as the 
orthogonal pencil of invariant circles is nonintersecting, tangent, or inter- 
secting), As special cases we have, respectively, a rotation, a translation, 
and a dilatation. The most important kind of rotary homography is 
the Mébius involution, which, being the inversive counterpart of a half- 
turn, is the product of inversions in two orthogonal circles (e.g., the product 
of the inversion in a circle and the reflection in a diameter). Any product of 
four inversions that cannot be reduced to a product of two is called a 
loxodromic homography [Ford 1, p. 20}. 


EXERCISE 


When a given circle-preserving transformation is expressed as JS (where J is an inver- 
sion and S an isometry), J and S are unique. There is an equally valid expression SJ’, in 
which the isometry precedes the inversion. Why does this revised product involve the 
same S? Under what circumstances will we have ! = J? 


6.8 INVERSION IN A SPHERE 


By revolving Figures 6.la, 6.2a, 6.3a, 6.3b, and 6.4a about the line of 
centers (OP or OA or OC), we see that the whole theory of inversion ex- 
tends readily from circles in the plane to spheres in space. Given a sphere 
with center O and radius k, we define the inverse of any point P (distinct 
from Q) to be the point P’ on the ray OP whose distance from O satisfies 


OP x OP = k?, 


Alternatively, P’ is the second intersection of three spheres through P orthog- 
onal to the sphere of inversion. Every sphere inverts into a sphere, ro- 
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vided we include, as a sphere of infinite radius, a plane, which is the inverse 
of a sphere through O. Thus, inversion is a transformation of inversive (or 
“conformal”’) space, which is derived from Euclidean space by postulating a 
point at infinity, which lies on all planes and lines. 
Revolving the circle of Apollonius (Figure 6.6a) about the line AA’, we ob- 
tain the sphere of Apollonius, which may be described as follows: 


6.81 Given two points A, A’ and a positive number wu, let Ay and Ag divide 


AA’ internally and externally in the ratio 1: 4. Then the sphere on AyA2 as 
diameter is the locus of a point P whose distances from A and A' are in this ratio. 


EXERCISES 


1. If asphere with center O inverts A into A’ and Binto B’, the triangles OA B and 
OB’A’ are similar. 
2. In terms of a = OA and b = OB, we have (in the notation of Ex. 1) 
AB = © ap. 
ab 

3. The “cross ratio” of any four points is preserved by any inversion: 

AB/BD _ A’B'/B'D' 

AC/CD  A'C'/C’D’” 


[Casey 1, p. 100.] 

4. Two spheres which touch each other at O invert into parallel planes. 

5. Leta, 8, y be three spheres all touching one another. Let 01, o2,... be a se- 
quence of spheres touching one another successively and all touching a, 8, y. Then 
0g touches 01, so that we have a ring of six spheres interlocked with the original ring 
of three.* (Hint: Invert in a sphere whose center is the point of contact of « and £.) 


6.9 THE ELLIPTIC PLANE 


In some unaccountable way, while he [Davidson] moved hither and 
thither in London, his sight moved hither and thither in a manner that 
corresponded, about this distant island. ... When | said that nothing 
would alter the fact that the place [Antipodes Island] is eight thou- 
sand miles away, he answered that two points might be a yard away 
ona sheet of paper, and yet be brought together by bending the paper 
round. 


H. G. Wells (1866 -1946} 
(The Remarkable Case of Davidson's Eyes) 


Let S be the foot of the perpendicular from a point N to a plane o, as in 
Figure 6.9a. A sphere (not drawn) with center N and radius NS inverts the 
plane o into the sphere o’ on NS as diameter [Johnson 1, p. 108]. We have 


* Frederick Soddy, The Hexlet, Nature, 138 (1936), p. 958; 139 (1937), p. 77. 
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seen that spheres invert into spheres (or planes); therefore circles, being inter- 
sections of spheres, invert into circles (or lines), in particular, all the circles 
in o invert into circles (great or small) on the sphere o’, and all the lines in o 
invert into circles through N. Each point Pino yields a corresponding point 
P' ono’, namely, the second intersection of the line NP with o’. Conversely, 
each point P’ on o’, except N, corresponds to the point P in which NP’ meets 
o. The exception can be removed by making o an inversive plane whose 
point at infinity is the inverse of N. 

This inversion, which puts the points of the inversive plane into one-to- 
one correspondence with the points of a sphere, is known as stereographic 
projection. It serves as one of the simplest ways to map the geographical 
globe on a plane. Since angles are preserved, small islands are mapped with 
the correct shape, though on various scales according to their latitude. 


Figure 6.9a Figure 6.9b 


Another way is by gnomonic (or central) projection, in which the point 
from which we project is not N but O, the center of the sphere, as in Fig- 
ure 6.9b. Each point Pin o yields a line OP, joining it to O. This diameter 
meets the sphere in two antipodal points P;, Pe, which are both mapped on 
the same point P. Each line mino yields a plane Om, joining it to O. This 
diametral plane meets the sphere in a great circle. Conversely, each great 
circle of the sphere, except the “equator” (whose plane is parallel to o), cor- 
responds to a line ino. This time the exception can be removed by adding 
to the Euclidean plane o a line at infinity (representing the equator) with all 
its points, called points at infinity, which represent pairs of antipodal points 
on the equator. Thus, all the lines parallel to a given line contain the same 
point at infinity, but lines in different directions have different points at in- 
finity, all lying on the same line at infinity, (This idea is due to Kepler and 
Desargues.) 

When the line at infinity is treated just like any other line, the plane so 
extended is called the projective plane or, more precisely, the real projective 
plane [Coxeter 2]. Two parallel lines meet in a point at infinity, and an 
ordinary line meets the line at infinity in a point at infinity. Hence 
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6.91 Any two lines of the projective plane meet in a point. 


Instead of taking a section of all the lines and planes through O, we could 
more symmetrically (though more abstractly) declare that, by definition, the 
points and lines of the projective plane are the lines and planes through O. 
The statement 6.91 is no longer surprising; it merely says that any two planes 
through O meet in a line through O. 

Equivalently we could declare that, by definition, the lines of the projective 
plane are the great circles on a sphere, any two of which meet in a pair of 
antipodal points. Then the points of the projective plane are the pairs of 
antipodal points, abstractly identified. This abstract identification was 
vividly described by H. G. Wells in his short story, The Remarkable Case of 
Davidson’s Eyes. (A sudden catastrophe distorted Davidson’s field of vision 
so that he saw everything as it would have appeared from an exactly antip- 
odal position on the earth.) 

When the inversive plane is derived from the sphere by stereographic pro- 
jection, distances are inevitably distorted, but the angle at which two circles 
intersect is preserved. In this sense, the inversive plane has a partial metric: 
angles are measured in the usual way, but distances are never mentioned 
[Graustein 1, pp. 377, 388, 395]. 

On the other hand, gnomonic projection enables us, if we wish, to give the 
projective plane a complete metric. The distance between two points P and 
Q ino (Figure 6.9a) is defined to be the angle POQ (in radian measure), and 
the angle between two lines m and n in o is defined to be the angle between 
the planes Om and On. (This agrees with the customary measurement of dis- 
tances and angles on a sphere, as used in spherical trigonometry.) We have 
thus obtained the elliptic plane* or, more precisely, the real projective plane 
with an elliptic metric [Coxeter 3, Chapter VI; E. T. Bell 2, pp. 302-311; 
Bachmann 1, p. 21]. 

Since the points of the elliptic plane are in one-to-two correspondence with 
the points of the unit sphere, whose total area is 47, it follows that the total 
area of the elliptic plane (according to the most natural definition of “area” 
is 27. Likewise, the total length of a line (represented by a “great semi- 
circle”) is 7. The simplification that results from using the elliptic plane in- 
stead of the sphere is well illustrated by the problem of computing the area of 
a spherical triangle A BC, whose sides are arcs of three great circles. Figure 
6.9c shows these great circles, first in stereographic projection and then in 
gnomonic projection. The elliptic plane is decomposed, by the three lines 
BC, CA, AB, into four triangular regions. One of them is the given triangle 
A with angles A, B, C; the other three are marked a, 6, y in Figure 6.9c. (On 
the sphere, we have, of course, not only four regions but eight.) The two 


* The name “elliptic” is possibly misleading. It does not imply any direct connection with the 
curve called an ellipse, but only a rather far-fetched analogy. A central conic is called an ellipse 
or a hyperbola according as it has no asymptote or two asymptotes. Analogously, a non-Euclid- 
ean plane is said to be elliptic or hyperbolic (Chapter 16) according as each of its lines contains no 
point at infinity or two points at infinity. 
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lines CA, AB decompose the plane into two /unes whose areas, being pro- 
portional to the supplementary angles A and 7 — A, are exactly 2A and 
2(7 — A). The lune with angle A is made up of the two regions A and a. 
Hence 


Similarly A + B = 2B and A+ y = 2C. Adding these three equations 
and subtracting 


A+a+f64+y = 2zn, 
we deduce Girard’s “spherical excess” formula 
6.92 A=A48B4C-—gq, 


which is equally valid for the sphere and the elliptic plane. (A. Girard, In- 
vention nouvelle en algébre, Amsterdam, 1629.) 


Figure 6.9¢ 


EXERCISES 


1. Two circles in the elliptic plane may have as many as four points of intersection. 
2. The area of a p-gon in the elliptic plane is equal to the excess of its angle sum over 
the angle sum of a p-gon in the Euclidean plane. 


7 


Isometry and similarity 
in Euclidean space 


This chapter is the three-dimensional counterpart of Chapters 3 and 5. 
In §7.5 we find a proof (independent of Euclid’s Fifth Postulate) for the 
theorem (discovered by Michel Chasles in 1830) that every motion is a 
twist. In §7.6 we see that every similarity (except the twist and the glide 
reflection, which are isometries) is a three-dimensional dilative rotation. 

Most isometries are familiar in everyday life. When you walk straight for- 
ward you are undergoing a translation. When you turn a corner, it is a rota- 
tion; when you ascend a spiral staircase, a twist. The transformation that 
interchanges yourself and your image in an ordinary mirror is a reflection, 
and it is easy to see how you could combine this with a rotation or a trans- 
lation to obtain a rotatory reflection or a glide reflection, respectively. 


7.1 DIRECT AND OPPOSITE ISOMETRIES 


A congruence is either proper, carrying a left screw into a left and 
a right one into a right, or it is improper or reflexive, changing a left 
screw into a right one and vice versa. The proper congruences are 
those transformations which . . . connect the positions of points of a 
rigid body before and after a motion. 


H. Weyl [1, pp. 43-44] 


The axioms of congruence, a sample of which was given in 1.26, can be 
extended in a natural manner from plane geometry to solid geometry. In 
space, an isometry (Weyl’s “congruence”) is still any transformation that pre- 
serves length, so that a line segment PQ is transformed into a congruent seg- 
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ment P’Q’. The most familiar examples are the rotation about a given line 
through a given angle and the translation in a given direction through a given 
distance. In the former case the axis of rotation has all its points invariant; 
in the latter there is no invariant point, except when the distance is zero so 
that the translation is the identity. A reflection is the special kind of isom- 
etry which has a whole plane of invariant points: the mirror. By a sim- 
ple argument involving three spheres instead of two circles, we can easily 
prove the following analogue of Theorem 2.31: 


7.11 If an isometry has three non-collinear invariant points, it must be 
either the identity or a reflection. 


When two tetrahedra ABCP, ABCP’ are images of each other by reflection 
in their common face, we may regard the “broken line” formed by the three 
edges AB, BC, CP as akind of rudimentary screw, and the image formed by 
AB, BC, CP’ as an oppositely oriented screw: if one is right-handed the other 
is left-handed. A model is easily made from two pieces of stiff wire, with 
right-angled bends at Band C. In this manner the idea of sense can be ex- 
tended from two dimensions to three: we can say whether two given con- 
gruent tetrahedra agree or disagree in sense. In the former case we shall 
find that either tetrahedron can be moved (like a screw in its nut) to the posi- 
tion previously occupied by the other; such a motion is called a twist. 

This distinction arises in analytic geometry when we make a coordinate 
transformation. If O is the origin and X, Y, Z are at unit distances along 
the positive coordinate axes, the sense of the tetrahedron OX YZ determines 
whether the system of axes is right-handed or left-handed. (A coordinate 
transformation determines an isometry transforming each point (x, y, z) into 
the point that has the same coordinates in the new system.) 

Since an isometry is determined by its effect on a tetrahedron, 


7.12 Any two congruent tetrahedra ABCD, A’'B’C'D' are related by a 
unique isometry ABCD —> A’B’C’D’, which is direct or opposite according as 
the sense of A’ B'C’ D' agrees or disagrees with that of ABCD. 


(Some authors, such as Weyl, say “proper or improper” instead of “direct 
or opposite.”) 
The solid analogue of Theorem 3.12 is easily seen to be: 


7.13 Two given congruent triangles are related by just two isometries: one 
direct and one opposite. 


As acounterpart for 3.13 we have [Coxeter 1, p. 36]: 


7.14 Every isometry is the product of at most four reflections. If there is an 
invariant point, “four” can be replaced by “‘three.”’ 


Since a reflection reverses sense, an isometry is direct or opposite accord- 
ing as it is the product of an even or odd number of reflections: 2 or 4 in 
the former case, | or 3 in the latter. In particular, a direct isometry with 
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an invariant point is the product of just two reflections, and since the two 
mirrors have a common point they have a common line. Hence 


7.15 Every direct isometry with an invariant point is a rotation. 
Also, as Euler observed in 1776, 


7.16 The product of two rotations about lines through a point O is another 
such rotation. 


EXERCISE 


The product of rotations through 7 about two intersecting lines that form an angle « 
is a rotation through 2a. 


7.2 THE CENTRAL INVERSION 


One of the most important opposite isometries is the central inversion (ot 
“reflection in a point”), which transforms each point P into the point P’ for 
which the midpoint of PP’ is a fixed point O. This can be described as the 
product of reflections in any three mutually perpendicular planes through O. 
Taking these three mirrors to be the coordinate planes x = 0, y = 0,z = 0, 
we see that the central inversion in the origin transforms each point (x, y, Z) 
into(—x,—y, —z). 

The name “central inversion,” though well established in the literature of 
crystallography, is perhaps unfortunate: we must be careful to distinguish it 
from inversion in a sphere. 

For most purposes the central inversion plays the same role in three di- 
mensions as the half-turn in two. But we must remember that, since 3 is 
an odd number, the central inversion is an opposite isometry whereas the 
half-turn is direct. In space, the name half-turn is naturally used for the 
rotation through 7 about a line (or the “reflection in a line”), which is still 
direct [Lamb 1, p. 9}. 


EXERCISE 


What is the product of half-turns about three mutually perpendicular lines through 
a point? 


7.3 ROTATION AND TRANSLATION 


The treatment of translation in § 3.2 can be adapted to three dimensions 
by defining a translation as the product of two central inversions. We soon 
see that either the first center or the second may be arbitrarily assigned, and 
that the two inversions may be replaced by two half-turns about parallel axes 
or by two reflections in parallel mirrors. 

Thus the product of two reflections is either a translation or a rotation. 
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The latter arises when the two mirrors intersect in a line, the axis of the ro- 
tation. In particular, the product of reflections in two perpendicular mirrors 
is a half-turn. 

The product of reflections in two planes through a line /, being a rotation 
about J, is the same as the product of reflections in two other planes through 
| making the same dihedral angle as the given planes (in the same sense). 
Similarly, the product of reflections in two parallel planes, being a transla- 
tion, is the same as the product of reflections in two other planes parallel 
to the given planes and having the same distance apart. 


EXERCISE 


What is the product of reflections in three parallel planes? 


7.4 THE PRODUCT OF THREE REFLECTIONS 


The three simplest kinds of isometry, namely rotation, translation and re- 
flection, combine in commutative pairs to form the twist (or “screw dis- 
placement”), glide reflection and rotatory reflection. A twist is the product of 
a rotation with a translation along the direction of the axis. A glide reflection 
is the product of a reflection with a translation along the direction of a line 
lying in the mirror, that is, the product of reflections in three planes of which 
two are parallel while the third is perpendicular to both. A rotatory reflec- 
tion is the product of a reflection with a rotation whose axis is perpendicular 
to the mirror. When this rotation is a half-turn, the rotatory reflection re- 
duces to a central inversion. 

Any rotatory reflection can be analysed into a central inversion and a re- 
sidual rotation. For, if the rotation involved in the rotatory reflection is a 
rotation through @, we may regard it as the product of a half-turn and a ro- 
tation through @ + 7(or@ — 7). Thus a rotatory reflection can just as well 
be called a rotatory inversion: the product of a central inversion and a rota- 
tion whose axis passes through the center. 

Any opposite isometry T that has an invariant point O is either a single 
reflection or the product of reflections in three planes through O. Its prod- 
uct TI with the central inversion in O, being a direct isometry with an in- 
variant point, is simply a rotation S about a line through O. Hence the given 
opposite isometry is the rotatory inversion 


T=SI1=SI: 
7.41 Every opposite isometry with an invariant point is a rotatory inversion. 


Since three planes that have no common point are all perpendicular to one 
plane a, the reflections in them (as applied to a point in a) behave like the re- 
flections in the lines that are their sections by a. Thus we can make use of 
Theorem 3.31 and conclude that 
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7.42 Every opposite isometry with no invariant point is a glide reflection. 


EXERCISES 


1. What is the product of reflections in three planes through a line? 

2. Let ABC and A’B’C’ be two congruent triangles in distinct planes. Consider the 
perpendicular bisectors of AA’, BB’, CC’. If these three planes have just one common 
point O, the two triangles are related by a rotatory inversion with center O. (Hint: If 
they were related by a rotation, the three planes would intersect in a line.) 


3. Every opposite isometry is expressible as the product of a reflection and a half- 
turn. 


7.5 TWIST 


The only remaining possibility is a direct isometry with no invariant point. 
Let S be any direct isometry (with or without an invariant point), transform- 
ing an arbitrary point A into A’. Let Ry be the reflection that interchanges A 
and A’. Then the product R4S is an opposite isometry leaving A’ invariant. 
By 7.41, this is a rotatory inversion or rotatory reflection ReR3R4, the prod- 
uct of a rotation R2R3 and a reflection Rg, the mirror for R4 being perpen- 
dicular to the axis for ReR3. Since this rotation may be expressed as the 
product of two reflections in various ways (§7.3), we can adjust the mirrors 
for R2 and R3 so as to make the former perpendicular to the mirror for R4. 
Since both these planes remain perpendicular to the mirror for R4, we now 
have 


S= R,iReR3Rz, 


the product of the two rotations R1Rez, R3R4, both of which are half-turns 
[Veblen and Young 2, p. 318]: 


7.51 Every direct isometry is expressible as the product of two half-turns. 


Figure 7.5a 


If the isometry has an invariant point, it is a rotation, which may be ex- 
pressed in various ways as the product of half-turns about two intersecting 
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lines. When there is no invariant point, the axes of the two half-turns are 
either parallel, in which case the product is a translation, or skew, like two 
opposite edges of a tetrahedron. Two skew lines always lie in a pair of paral- 
lel planes, namely, the plane through each line parallel to the other. 

Since a half-turn is the product of reflections in any two perpendicular 
planes through its axis, the two half-turns RiRe, R3R4 with skew axes are re- 
spectively equal to R’1R’2, R’sR’4, where the mirrors for R’2 and R’, are par- 
allel while the other two are perpendicular to them (Figure 7.5a). Hence 


RiReRsR4 = R’1R’2R’sR’4 = R'1R’sR’2R’s, 


where the interchange of the middle reflections is possible since the half-turn 
R’2R’3 may be equally well expressed as R’sR’2. We have now fulfilled our 
purpose of expressing the general direct isometry as a twist: the product 
of the rotation R’;R’3 and the translation R’2R’4 along the axis of the rota- 
tion. (This axis meets both the skew lines at right angles, and therefore 
measures the shortest distance between them.) In other words, 


7.52 Every displacement is either a rotation or a translation or a twist. 


(For an alternative treatment see Thomson and Tait [1, § 102].) 


EXERCISES 
1. What kind of isometry transforms the point (x, y, z) into 
(a) (x, ys —2), (b) (-», x, 2), (c) (x, yz + 1), 
(d) (-y, 4 2 + D, (e) (-x% v2 + UD, (f) (—y, x, —2)? 


2. The product of half-turns about two skew lines at right angles is a twist, namely, 
the product of a half-turn about the line of shortest distance and a translation through 
twice this shortest distance. (Veblen and Young [2, p. 324] named this a half twist.) 


7.6 DILATIVE ROTATION 


It can be proved by elementary methods that every Euclidean similarity 
other than a rigid motion has a fixed point. 


Hilbert and Cohn-Vossen [1,p. 331] 


In Euclidean space, the definition of dilatation is exactly the same as in the 
plane. In fact, § 5.1 can be applied, word for word, to three dimensions, ex- 
cept that the special dilatation AB — BA or O(—1) is not a half-turn but a 
central inversion (§ 7.2). Likewise, §5.2.applies to spheres just as well as 
to circles: Figure 5.2a may be regarded as a plane section of two unequal 
spheres with their centers C, C’ and their centers of similitude O, 01. Two 
equal spheres are related by a translation and by a central inversion. 

However, an important difference appears when we consider questions of 
sense. In the plane, every dilatation is direct, but in space the dilatation 
O(A) is direct or opposite according as A is positive or negative; for example, 
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the central inversion O(~ 1) is opposite, as we have seen. 

In space, as in the plane, two similar figures are related by a similarity, 
which in special cases may be an isometry or a dilatation. By a natural ex- 
tension of the terminology we now take a dilative rotation to mean the prod- 
uct of a rotation about a line / (the axis) and a dilatation whose center O lies 
oni, The plane through O perpendicular to / is invariant, being transformed 
according to the two-dimensional “dilative rotation” of § 5.5. In the special 
case when the rotation about /is a half-turn, there are infinitely many other 
invariant planes, namely all the planes through /. Any such plane is trans- 
formed according to a dilative reflection. 

Suppose a dilative rotation is the product of a rotation through angle a 
and a dilatation O(A) (where O lies on the axis). The following values of a 
and A yield special cases which are familiar: 


a A Similarity 

0 1 Identity 

T 1 Half-turn 

a ] Rotation 

7 —! Reflection 

0 -! Central inversion 

a ~1 Rotatory inversion | 
0 A Dilatation 


We observe that this table includes all kinds of isometry, both direct and op- 
posite, except the translation, twist and glide reflection (which have no in- 
variant points). Still more surprisingly, we shall find that, with these same 
three exceptions, every similarity is a dilative rotation. 

The role of similar triangles is now taken over by similar tetrahedra. 
Evidently 


7.61 Two given similar tetrahedra ABCD, A'B'C'D' are related by a 
unique similarity ABCD —» A’ B'C' D’, which is direct or opposite according as 
the sense of A’ B'C’ D’ agrees or disagrees with that of ABCD. 


In other words, a similarity is completely determined by its effect on any 
four given non-coplanar points, and we have the following generalization of 
Theorem 7.13: 


7.62 Two given similar triangles ABC, A'B'C’ are related by just two 
similarities: one direct and one opposite. 


As a step towards proving that every similarity which is not an isometry 
is a dilative rotation, let us first prove 


7.63 Every similarity which is not an isometry has just one invariant point. 
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Consider any given similarity S, whose ratio of magnification is p 1. 
Let S transform an arbitrary point A into A’. If A’ coincides with A, we have 
the desired invariant point. If not, let OQ be the point that divides the seg- 
ment AA’ in the ratio 1:y, externally or internally according as S is direct or 
opposite: that is, construct Q so that QA’ = +yQA. Let D denote the direct 
ot opposite dilatation Q(+y-1). Then SD, having ratio of magnification 1, 
is a direct isometry which leaves A invariant. By Theorem 7.15, SD is a ro- 
tation about some line / through A. The plane through Q perpendicular to / 
is transformed into itself by both SD and D~! and, therefore, also by their 
product S. On this invariant plane, S induces a two-dimensional similarity 
which, by Theorem 5.42, has an invariant point. Finally, this invariant point 
is unique, for if there were two distinct invariant points, the segment formed 
by them would be invariant instead of being multiplied by p. 


Having found the invariant point (or center) O, we can carry out a sim- 
plified version of the above procedure, with O for A. Since A’ and Q both 
coincide with O, 8 is the product of a rotation about a line through O and 
the dilatation O(yp), that is to say, S is a dilative rotation: 


7.64 Every similarity is either an isometry or a dilative rotation. 


In other words, every similarity is either a translation, a twist, a glide re- 
flection, or a dilative rotation, provided we regard the last possibility as in- 
cluding all the special cases tabulated in the middle of page 102. 


By Theorem 7.62, there are two dilative rotations, one direct and one op- 
posite, which will transform a given triangle ABC into a similar (but not 
congruent) triangle A’B’C’. The ratio of magnification, » ~ 1, is given by 
the equation A’B’ = pAB. Let Ai and Ae divide AA’ internally and ex- 
ternally in the ratio 1: ». Let B, and Be, C, and C2 divide BB’, CC’ in the 
same manner. Consider the three spheres whose diameters are 41Ae, Bi Bo, 
CiC2. These are “spheres of Appollonius” (Theorem 6.81); for example, 
the first is the locus of points whose distances from A and A’ are in the ratio 
1: p. Any point O for which 


OA'=y»OA, OB’ =pOB, OC =pOC 


must lie on all three spheres. We have already established the existence of 
two such points. Hence the centers of the two dilative rotations may be 
constructed as the points of intersection of these three spheres. 


In § 3.7 we used a translation to generate a geometric representation of the 
infinite cyclic group C,, (which is the free group with one generator). We 
see now that the same abstract group has a more interesting representation in 
which the generator is a dilative rotation. Some thirty elements of this group 
can be seen in the Nautilus shell [Thompson 2, p. 843, Figure 418]. 
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EXERCISES 


1. How is the point (x, y, z) transformed by the general dilative rotation whose cen- 
ter and axis are the origin and the z-axis? 
2. Find the axis and angle for the dilative rotation 


(XY; Z) > (Hz, UX MY). 

3. How does the above classification of similarities deal with the three-dimensional 
“dilative reflection’: the product of a dilatation O(A) and the reflection in a plane 
through O? 

4. Could Theorem 7.63 be proved in the manner of the exercise at the end of §5.4 
(page 73)? 


7.7 SPHERE-PRESERVING TRANSFORMATIONS 


The reasoning used in § 6.7 extends readily from two to three dimensions, 
yielding the following analog* of Theorem 6.71: 


7.71 Every sphere-preserving transformation of inversive space is either a 
similarity or the product of an inversion (in a sphere) and an isometry. 
EXERCISE 


Every sphere-preserving transformation can be expressed as the product of r reflec- 
tions and s inversions, where 


ee René Lagrange, Produits @'inversions et métrique conforme, Cahiers scientifiques, 23 (Gauthier- 
Villars, Paris, 1957), p, 7. See also Coxeter, Annali di Matematica pura ed applicata, 53 (1961), pp. 
165-172. 
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Coordinates 


In the preceding chapters, a few exercises on coordinates have been in- 
serted for the sake of those readers who are already aquainted with analytic 
geometry. Other readers, having omitted such exercises, are awaiting en- 
lightenment at the present stage. In addition to the usual rectangular Car- 
tesian coordinates, we shall consider oblique and polar coordinates. (The 
polar equation for an ellipse is important because of its use in the theory of 
orbits.) After a brief mention of special curves we shall give an outline of 
Newton’s application of calculus to problems of arc length and area. The 
section on three-dimensional space culminates in a surprising property of 
the doughnut-shaped torus. 


8.1 CARTESIAN COORDINATES 


Though the idea behind it all is childishly simple, yet the method of 
analytic geometry is so powerful that very ordinary boys of seventeen 
can use it to prove results which would have baffled the greatest of 
the Greek geometers—Euclid, Archimedes, and Apollonius. 


E. T. Bell (1B83 -1960) 
[E. T. Bell 1, p. 21] 


Analytic geometry may be described as the representation of the points 
in n-dimensional space by ordered sets of n (or more) numbers called co- 
ordinates. For instance, any position on the earth can be specified by its 
latitude, longitude, and height above sea level. 

The one-dimensional case is well illustrated by a thermometer. There is 
a certain point on the line associated with the number 0; the positive inte- 
gers 1, 2, 3,... are evenly spaced in one direction away from 0, the nega- 
tive integers —1, —2, —3,...1in the opposite direction, and the fractional 
numbers are interpolated in the natural manner. The displacement from 
one point x to another point x’ is the positive or negative number x’ — x. 
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In the two-dimensional case, the position of a point in a plane may be 
specified by its distances from two fixed perpendicular lines, the axes. This 
notion can be traced back over two thousand years to Archimedes of Syra- 
cuse and Apollonius of Perga, or even to the ancient Egyptians; but it was 
first developed systematically by two Frenchmen: Pierre Fermat (whose 
problem about a triangle we solved in § 1.8) and René Descartes (1596-1650). 
In their formulation the two distances were taken to be positive or zero. The 
important idea of allowing one or both to be negative was supplied by Sir 
Isaac Newton (1642-1727), and it was G. W. Leibniz (1646-1716) who first 
called them “coordinates.” (The Germans write Koordinaten, the French 
coordonnées. ) 


Figure 8.1a 


For some purposes it is just as easy to use oblique axes, as in the second 
part of Figure 8.la. Starting from the origin O, where the axes intersect, 
we reach the general point (x, y) by going a distance x along the x-axis OX 
and then a distance y along a line parallel to the y-axis OY. The x-axis is 
said to have the equation y = 0 because every point (x, 0) satisfies this equa- 
tion; similarly, x = 0 is the equation of the y-axis. On any other line through 
the origin, consideration of homothetic triangles shows that the ratio y/x is 
constant; thus any line through the origin (0, 0) may be expressed as 
ax + by = 0. 

To obtain the equation for any other line, we take a point (x1, yi) onit. In 
terms of new coordinates x’, y’, derived by translating the origin from (0, 0) 
to (x1, V1), the line may be expressed as ax’ + by’ = 0. Since x’ = x — x1 
and y’ = y — y,, the same line, in terms of the original coordinates, is 


a(x — x1) + Wy — v1) = 9 
or, say, 
8.11 ax + by4+c=0. 


Thus every line has a /inear equation, and every linear equation determines a 
line. In particular, the line that makes intercepts p and g on the axes is 
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X,Y _4y. 
8.12 5 15 = 15 
for, this equation is linear and is satisfied by both (p, 0) and (0, g). Two 
lines of the form 8.11 are parallel if they have the same ratio a/b (includ- 
ing, as one possibility, b = 0 for both, in which case they are parallel to the 
y-axis). The point of intersection of two nonparallel lines is obtained by 
solving the two simultaneous equations for x and y. 

If b ~ 0, the equation 8.11 may be solved for y in the form y = 
—(ax + c)/b. More generally, points whose coordinates satisfy an equation 
F(x y) = Oory = f(x) can be plotted by giving convenient values to the 
abscissa x and calculating the corresponding values of the ordinate y. This 
procedure is particularly appropriate when f(x) is a one-valued function of 
x. In other cases we may prefer to use parametric equations, expressing x 
and y as functions of a single variable (or parameter) t. For instance, if Py 
denotes the point (x1, v1), any line through P; has parametric equations 


8.13 x= X% +X yoy st VY, 


where X and Y depend on the direction of the line. 

Sometimes, for the sake of symmetry, the single parameter ¢ is replaced 
by two parameters, ¢, and fe, related by an auxiliary equation. For instance, 
the general point (x, y) on the line through two given points P; and Py: 
is given by 

X= 6%1 + teoxe, YVotyit+ teye ntle= 1. 


This point P, dividing the segment P;P2 in the ratio te : t1, is the centroid (or 
“center of gravity”) of masses ¢, at Py and tz at Py. Positions outside the in- 
terval from P, (where t2 = 0) to Ps (where t; = 0) are covered by allowing ¢2 
or ft; to be negative, while still satisfying 4; + tg = 1; we may justify this by 
calling them “electric charges” instead of “masses.” 

For problems involving the distance between two points or the angle be- 
tween two lines, it is often advisable to use rectangular axes, so that the dis- 
tance from the origin to (x, y) is the square root of x? + y?, and the distance 
P,P2 is the square root of 


(x1 — X2)? + O41 — yz)? 
Multiplication of the expression / = ax + by + c by a suitable number en- 


ables us to normalize the equation / = 0 of the general line so that a? +b? = 1. 
Writing / = 0 in the form 


(x — xy 4 2aly)? + (YY — ya + 26h)? = (x — x1)? + (Y — y1)*, 


where /) = ax, + by; + c, we recognize it as the locus of points equidistant 
from 


(%4 _ 2aly, V1 _ 2bl1) and (%1, yi); 
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in other words, the line / = 0 serves asa mirror which interchanges these two 
points by reflection. It follows that the foot of the perpendicular from P, to 
1 = Ois (x1 — ah, y1 — bh), and that the distance from P, to the line is +1, 
(provided a? + b? = 1). In particular, the distance from the origin to / = 0 
is He. 
The locus of points at unit distance from the origin is the circle 
x2 + y? = 1, 
which has the parametric equations 
x=cos#, y=sind 
or, with ¢ = tan 46, 


_l-# _ 2 
eT ae Me i ae ge 
EXERCISES 


1. In terms of general Cartesian coordinates, the point (x, y) will be transformed into 
(-—x, —y) _ by the half-turn O( — 1) (§ 5.1), 
(MX, UY) by the dilatation O(,), 
(x + a4,y) by atranslation along the x-axis. 
2. In terms of rectangular Cartesian coordinates, the point (x, y) will be transformed 
into 
(x, —y) by reflection in the x-axis, 
(, x) by reflection in the line x = y, 
(-y, x) by a quarter-turn about the origin, 
(x + a, —y)by a glide reflection (in and along the x-axis), 
(ux, —py) by a dilative reflection (§ 5.6). 
3. Let Mj; denote the midpoint of P;P;. For any four points P, P2, P3, Ps, the mid- 
points of M1y2.M3a, M13Mo4, My4Mo3 all coincide. 


8.2 POLAR COORDINATES 


The deriving of short cuts from basic principles covers some of the finest 
achievements of the greatest mathematicians. 


M.H. A. Newman (1B97 - ) 
(Mathematical Gazette 43 (1959), p. 170) 


For problems involving directions from a fixed origin (or “pole”) O, we 
often find it convenient to specify a point P by its polar coordinates (r, 9), 
where r is the distance OP and @ is the angle that the direction OP makes 
with a given initial line OX, which may be identified with the x-axis of rec- 
tangular Cartesian coordinates. Of course, the point (r, 0) is the same as 
(r, 0+ 2n7) for any integer n. It is sometimes desirable to allow r to be 
negative, so that (r, @) is the same as (—r, 8 + =). 


POLAR COORDINATES TIT 


Given the Cartesian equation for a curve, we can deduce the polar equa- 
tion for the same curve by substituting 


8.21 x =rcos@, yo=rsiné. 
For instance, the unit circle x? + y? = 1 has the polar equation 
(r cos 6)? + (r sin 6)? = 1, 


which reduces to 
aa 


(The positive value of r is sufficient if we allow @ to take all values from —7 
to 7 or from 0 to 27.) This procedure is helpful in elementary trigonometry, 
where students often experience some difficulty in proving (and remember- 
ing) the trigonometrical functions of obtuse and larger angles. Taking an 
angle XOP with OP = 1, we can simply define its cosine and sine to be the 
abscissa and ordinate of P. 

Polar coordinates are particularly suitable for describing those isometries 
(§ 3.5) and similarities (§ 5.4) which have an invariant point; for this point 
may be used as the origin. Thus the general point (r, @) will be transformed 
into 

(r, 9 + a) by a rotation through a, 
(r,9 + 7) by a half-turn, 


(r, — #) by reflection in the initial line, 
(r, 2a — 6) by reflection in the line # = a, 
(ur, 8) by the dilatation O(u), 


(ur, 8 + a) by a dilative rotation with center O, 
(ur, 2a — 8) by adilative reflection with center O and axis # = a. 


Likewise, inversion in the circle r = k (see § 6.1) will transform (r, #) into 
(k2/r, 0). 

The Cartesian expressions for the same transformations can be deduced 
at once. For instance, the rotation through a about O transforms (x, y) 
into (x’, y’) where, by 8.21, 

x’ =rcos(@+ a) = r(cos@cosa — sin@ sina) = x cosa — y sina, 

y =rsin (6+ a) =r(cos@sina + sin@cosa)= x sina + ycosa. 

In particular, a quarter-turn transforms (x, y) into (—y, x), and it follows 
that a necessary and sufficient condition for two points (x, y) and (x’, y’) to 
lie in perpendicular directions from the origin is 

8.22 xx’ + yy’ = 0. 

Such a transformation as 


é 


x’ = xcosa —ysina 
8.23 ; deat 


y =xsina + ycosa 
has two distinct aspects: an “active” or alibi aspect, in which each point 
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(x, ¥) is moved to a new position (x’, y’), and a “passive” or alias aspect, 
in which the point previously named (x, y) is renamed (x’, y’). The latter 
aspect is sometimes used to simplify the equation of a given curve. For in- 
stance, the curve 


ax? + 2hxy + by? = 1 
becomes 


a(x cosa — ysin a)? + 2h (x cosa — ysin a) (x sina + ycos a) 
+ b(xsina + ycosa)? = 1, 


in which the coefficient of xy is no longer 2h but 
2h(cos? a — sin? a) — 2(a — b)cosasina = 2hcos 2a —(a — b) sin 2a. 


Since this vanishes when tan 2a = 2h/(a — b), the equation is simplified 
by rotating the axes through the particular angle 


2h 


The area of a triangle OP,P2, where P; has polar coordinates (rj, 6;), is 
taken to be positive if 01 < 92, negative if 6, > @2. With this convention, 
the area is 


4ryr2 sin (8g — 1) = 4rire(sin 82 cos 4; — cos A sin 41) 


or, in Cartesian coordinates, 


: xy V1 
8.24 4 (x12 — X2y1) = 4 


X2  ye2 


To find the area of any triangle P1P2P3, we choose new axes parallel to OX, 
OY and passing through P3. Since the new coordinates of P; (¢ = 1 or 2) 
are (X; — X3, yi — ys), the area of any triangle P,P2P3 is 


Xy — Xg3 Yi — Ys X1 yi | 
8.25 4 =4\x2 ye Ij. 
X2 —X3 y2—- ys x3 yz | 


It follows that a necessary and sufficient condition for Pi, P2, P3 to be col- 
linear is that this three-rowed determinant should be zero. The equation 
for the line P;P2 may be derived from this condition by writing (x, y) for 


(Xs, Ya). 


EXERCISES 


1. Use a well-known trigonometric formula to obtain an expression for the square 
of the distance between the points whose polar coordinates are (ri, #1), (re, 42). 
2. Obtain polar coordinates for the midpoint. 
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3. Obtain a polar equation for the line y = x tana. (Hint: Allow r to take nega- 
tive values.) 


4. Use 8.22 to obtain the condition 
aa’ + bb’ = 0 


for two lines ax + by + ¢ = Oanda’x + b’y + c’ = 0 to be perpendicular. Deduce 
that 


ax +by+c=0, bx-—av+e’=0 
are perpendicular for all values of a, b,c, ¢’. 
5. Use a suitable rotation of axes to simplify the equation of the curve 


4x2 4 24xy + lly? = 5. 


8.3 THE CIRCLE 


A figure of universal appeal. 
D. Pedoe [1, p. vii] 


The circle with center (x’, y’) and radius k, being the locus of points (x, y) 
distant k from (x’, y’), is 
Qi xP +E Oy PR =R 
Thus 
8.31 x2 4+ y2 4 2gx 4+ 2fy+ec=0 


is a circle with center (—g, —f) whenever g? + f2 >c. If (x1, y1) lies on 
the circle, the tangent at this point P is 


MX + iV + g(x + %)t+fY+y)+e=0 
or (414+ 28) + O1t+/y + (941 + fri tc) = 90. 
For this line passes through P, and is perpendicular to the diameter 
x+g _yrf 
mtg ntf 


The circle 8.31 is orthogonal to another circle 


x? + y2 4 29’x + 2ffy +c = 0 


if, for a suitable P;, the center of each lies on the tangent at P; to the other. 
Adding 


Gites +t/f =a1+fite 


to the analogous relation with primed and unprimed letters interchanged, 
we see that the orthogonality of the two circles implies 


2ge + 2ff =c+ec'. 
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Conversely, any two circles that satisfy this relation are orthogonal. In par- 
ticular, the circles 


8.32 x2 + y2 4 Igx +c¢= 0, 
8.33 x24 y2 4+ fy —c = 0, 


whose centers lie on the x- and y-axes respectively, are orthogonal. Keep- 
ing c constant and allowing g or f to take various values, we obtain two or- 
thogonal pencils of coaxal circles, whose radical axes are x = 0 andy = 0 
respectively. Ifc = 0, we have two orthogonal tangent pencils, each con- 
sisting of all the circles that touch one of the axes at the origin. Ifc > 0, 
the circles 8.32, for various values of g, form a nonintersecting pencil, in- 
cluding the two point circles 


(x+ ge" +y? = 0, ga=tye, 


which are the limiting points (== \/c, 0) of the pencil. The circles 8.33, which 
pass through these two points, form the orthogonal inzersecting pencil. 
“ EXERCISES 


1, The circle x2 + y? = k? inverts (x, y) into 


k2 x k2y 
x24 y2 2p yep” 


Apply this inversion to the line 8.11 and to the circle 8.31. 

2. Find the locus of a point (x, y) whose distances from (A/y, 0) and (uk, 0) are in 
the ratio 1: uw (cf. § 6.6). 

3. Obtain the Cartesian equation of the locus of a point the product of whose dis- 
tances from (a, 0) and (—a, 0) is a?._ Deduce the polar equation of this “figure of 
eight,” which is the Jemniscate of Jacob Bernoulli, 

4. Given two equal circles in contact, find the locus of the vertices of triangles for 
which the first is the nine-point circle (§ 1.7), the second is an excircle (§ 1.5). (Answer: 
A lemniscate.*) 

5. Acircle of radius b rolls without sliding on the outside of a fixed circle of radius 
nb, The locus of a point fixed on the circumference of the rolling circle is called an 
epicycloid (when n is aninteger, ann-cusped epicycloid). Obtain the parametric equa- 
tions 


x =(n + Dbcost — bcos(n + lt, 
yo(n + lbsint — b sin(a + Ie. 
Sketch the cases n = 1 (the cardioid), n = 2 (the nephroid),n = 3, andn = 4. [See 
Robson 1, p. 368.] 

6. Shifting the origin to the cusp (4, 0), obtain the polar equation 

r = 2b(1 — cos @) 

for the cardioid (8.34 with n = 1). Deduce that chords through the cusp are of con- 
stant length. 

* Richard Blum, Canadian Mathematical Bulletin, 1 (1958), pp. 1-3. 


8.34 
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7. Accircle of radius b rolls without sliding on the inside of a fixed circle of radius 
nb, where n > |. Find parametric equations for the hypocycloid (when n is integral, 
the n-cusped hypocycloid) which is the locus of a point fixed on the circumference of 
the rolling circle. Sketch the cases n = 2 (which is surprising), n = 3 (the deltoid), 
and n = 4 (the astroid), Eliminate the parameter in the last two cases, obtaining, for 
the astroid, 


x2/3 4 y2/B = g2/3 (a = 4b) 


[Lamb 2, pp. 297-303]. 

Steiner discovered that all the Simson lines for any given triangle touch a deltoid. 
Three of the lines, namely, those parallel to the sides of Morley’s equilateral triangle 
(§ 1.9), are the “apsidal” tangents which the deltoid shares with the nine-point circle, 
Their points of contact are the vertices of the equilateral triangle XYZ described in 
Ex, 3 on page 20. For details, see Baker [1, pp. 330-349, especially p. 347], 


8.4 CONICS 


In addition to the straight lines, circles, planes and spheres with which 
every student of Euclid is familiar, the Greeks knew the properties of 
the curves given by cutting a cone with a plane—the ellipse, parabola 
and hyperbola. Kepler discovered by analysis of astronomical obser- 
vations, and Newton proved mathematically on the basis of the in- 
verse square law of gravitational attraction, that the planets move in 
ellipses. The geometry of ancient Greece thus became the corner- 
stone of modern astronomy. 


J, L. Synge (1897 - ) 
[Synge 2, p. 32] 


Figure 8.4a 


There are several different ways to define a conic (or “conic section”). 
One of the most straightforward is the following (cf. § 6.6): A conic is the 
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locus of a point P whose distance OP from a fixed point O is « times its dis- 
tance PK from a fixed line HX (Figure 8.4a), where e is a positive constant. 

Other definitions for a conic, proposed by Menaechmus about 340 B.c., 
were reconciled with this one by Pappus of Alexandria (fourth century A.D.) 
or possibly by Euclid [see Coolidge 1, pp. 9-13]. 

The conic is called an ellipse if ¢ < 1, a parabola if « = 1, a hyperbola if 
e > 1. (These names are due to Apollonius.) 

The point O and the line HX are called a focus and the corresponding 
directrix. The number «, called the eccentricity, is usually denoted by e (but 
then, to avoid any possible misunderstanding, we should add “where e need 
not be the base of the natural logarithms” [Littlewood 1, p. 43]). The chord 
LL’ through the focus, parallel to the directrix, is called the latus rectum; 
its length is denoted by 2/, so that 


l= OL = eli. 


In terms of polar coordinates with the initial line OX perpendicular to the 
directrix, we have 


r= OP = &PK= (LH = r cos @) 
8.41 = 1 — er cos @, 
so that 
8.42 Le 1 + cos 8. 


Since this equation is unchanged when we replace @ by — 8, the conic is sym- 
metrical by reflection in the initial line. When @ = 0, r = //(1 + «); and 
when @ = 7, r =//(1 — «); therefore the conic meets the initial line twice 
except when e = l. 

Ife < 1, 8.42 makes r finite and positive for all values of 0; therefore the 
ellipse is a closed (oval) curve: Ife = 1, r is still finite and positive except 
when @ = 7; therefore the parabola is not closed but extends to infinity in 
one direction. Ife > 1, ris positive or negative according as cos @ is greater 
or less than — 1/e; therefore the hyperbola consists of two separate branches, 
given by 

—a<@d<a, a<@0< 27-4, 


respectively, where a = arcsec (—®). 
Squaring 8.41, we obtain the Cartesian equation 


8.43 x2 4 y2 = (] — ex)? 
(which indicates that a circle may be regarded as an ellipse with eccentricity 
zero). If e ~ 1, we can divide by 1 — e? and then write a for 1/(1 — ¢), 
obtaining . 

y2 
1 — & 


x24 = la — 2eax 
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or (x + ea)? + ee = (14+ aja = @ 


x 2 2 
beeee aA 


= 1, 
a? la 


or 


Translating the origin to (— «a, 0), we can reduce this to 


x2 y2 
8.44 wtp mb 
where b2 =|la| =|1 — 2] a, 


so that /a = + b?, with the upper or lower sign according ase < lore > 1. 
In the latter case the above definition makes a negative, but we can reverse 
its sign without altering the equation 8.44, A more important remark is 
that the equation is still unchanged when we reverse the sign of x or y. This 
invariance shows that the ellipse and hyperbola are symmetrical by reflec- 
tion in either axis, and therefore also by the half-turn about the origin: their 
symmetry group is De, in the notation of § 2.5. For this reason the origin 
is called the center, and the ellipse and hyperbola are called central conics. 


(Eltipse) (Hyperbola) 


Figure 8.4b 


The geometrical significance of a and b is indicated in Figure 8.4b. For 
the ellipse, 2a and 2b are the major and minor axes; for the hyperbola, they 
are the fransverse and conjugate axes. The two branches of the hyperbola 


x2 ye 
8.441 ge 1 


lie in two opposite angular regions formed by the two lines 


x2 y2 xX y\{x yp 
. a ag =e or (2 - FF +f) = 2 
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These lines are called the asymptotes of the hyperbola. If a = b, they are 
perpendicular, and we have a rectangular (or “equilateral’’) hyperbola. 
If « = 1, then 8.43 reduces to 


y? = Gl — x) 
or, by reflection in the line x = }/, 

8.45 y? = 2x, 

This is the standard equation for the parabola (Figure 8.4c). 


Figure 8.4c¢ 
The most convenient parametric equations are: for the ellipse 
8.46 x = acost, y= bsint, 


for the parabola 


8.47 x = 21r2, y = 2, 
and for the hyperbola 
8.48 x = acosht, y=bsinht 
where 
cosh f= Ste sinh ¢ = = 8" 


(These functions will be discussed in § 8.6.) 


EXERCISES 
1. What kind of curve has the polar equation 
r= 41sec? 46? 
2. What kind of curve has the Cartesian equation 
4x2 4 24xy + lly? = 5? 


(See Ex. 5 at the end of §8.2.) 
3. The sum (or difference) of the distances of a point on an ellipse (or hyperbola) 
from the two foci is constant, 
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4. Express the eccentricity of a central conic in terms of its semiaxes a and b. What 
is the eccentricity of a rectangular hyperbola? 

5. Given points B and C, the locus of the vertex A of a triangle ABC whose Euler 
line is parallel to BC (as in Ex. 9 at the end of §1.6) is an ellipse whose minor axis is 
BC while its major axis is twice the altitude of the equilateral triangle on BC. (Hint: 
If A, B, C are (x, y) and (= 1, 0), the circumcenter, equidistant from A and C, is (0, 4 y).) 

6. An expression such as 


F = ax? + 2hxy + by? 


is called a binary quadratic form. It is said to be definite if ab > h?, so that F has the 
same sign for all values of x and y except x = y = 0. It is said to be positive definite 
if this sign is positive. It is said to be semidefinite if ab = h?, so that F is a times a 
perfect square; positive semidefinite if a > 0, so that F itself is a perfect square; indef- 
inite if ab < h?, so that F is positive for some values of x and y, negative for others. 
The equation F = | represents an ellipse if F is positive definite, a pair of parallel lines 
of F is positive semidefinite, and a hyperbola if F is indefinite. 

7. What happens to the equation for the rectangular hyperbola x? — y? = a? when 
we rotate the axes through the angle } 7? 


8. Describe a geometrical interpretation for the parameter ¢ in 8.46. [Hint: Com- 
pare (a cos ¢, b sin 2) with (a cos 7, a sin ?).] 

9, In what respect is the hyperbola 8.441 more satisfactorily represented by the 
equations 

X = a SeC u, y = btanu 

than by the equations 8.48? 

10. The circle r = / inverts the conic 8.42 into the limacgon 

r=/1(1 + €cos @). 

Sketch this curve for various values of e. When e = 1 (so that the conic is a parab- 
ola), it is“a cardioid. 

11, The circle r = a inverts the rectangular hyperbola r? = a? sec 26 into the lem- 
niscate of Bernoulli 

r2 = a® cos 26 


(see Ex. 3 at the end of §8.3). 


8.5 TANGENT, ARC LENGTH, AND AREA 


1 do not know what! may appear to the world; but to myself | seem 
to have been only like a boy playing on the seashore, and diverting 
myself in now and then finding a smoother pebble or a prettier shell 
than ordinary, whilst the great ocean of truth lay all undiscovered be- 
fore me. 


Sir Isaac Newton 


(Brewster's Memoirs of Newton, vol. 2, Chap. 27) 


The curves with which we shall be concerned are “‘rectifiable,” that is, there 
is a well-defined arc length s between any two points P and Q on such a 
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curve. Using the temporary notation Pp = P, P, = Q, we subdivide the 
given arc PQ byn — 1 points P3, P2,..., Pn—1 and consider the least upper 
bound 


Ss = sup (3 Py P, 
1 
of the lengths of the broken lines 


PP, + PiPo +... + Pr_iQ, 


for all possible subdivisions. 


Figure 8.5a 


The curve is often usefully regarded as the locus of a “moving” point. 
Any two points P and P’ on the curve are joined by a line called a secant. 
If P is fixed while P’ moves, the secant usually approaches a limiting posi- 
tion which is called the tangent at P. When we use rectangular Cartesian 
coordinates, we draw P’M parallel to the y-axis, as in Figure 8.5a, and let 
N be the foot of the perpendicular from P to P’M. The tangent has an 
angle of slope, ~, which may be defined as the limit of 7 NPP’. (The figure 
can be modified in an obvious manner when this angle is obtuse or nega- 
tive.) To compute y, we consider the right-angled triangle PP’N whose sides 
are the “increments” of x, y, s (all tending to zero): 


Ax = PN, Ay = NP’, As = PP’. 


Thus 
— tim PN = tim Ax = 
ey St pp a ae 
age tw DS ge oe 
sin y = lim pp ae a 


tan yp 
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Since PP’? = PN? 4 NP’ (or since cos? y + sin? y = 1), the element of 
arc length ds is given by 


8.51 ds? = dx? + dy, 


and the arc length s, from (xj, y1) to (2, ye), or from ¢ = ty tof = fp, is 
*2 dy\2 4 2 $s 
dx\2 dy\2 
ee 1+(2) dx = (¢) (2) dt. 
: f« al dx (Mae) Tae) | 
P 


As 
N 
P 


Figure 8.5b 


When a curve is given by its polar equation, the direction of the tangent 
at P is determined either by the angle ¢ which this tangent makes with the 
“radius” OP or (equally well) by the angle 


yp=O+o 


which it makes with the initial line OX (Figure 8.55). As before, let P and 
P’ be two neighboring points on the curve, so that the tangent at P is the 
limiting position of the secant PP’, Draw PN perpendicular to OP’ [Lamb 
2, p. 254]. Then 


.. NP’ : Ar _q 
8.52 = = | — = 
cos @ = lim ppl im eae o 
sin @ = lim DE = lim 8 ar @, 
8.53 tang = lim NP = Jim 7AP_ pO. 
i ae NP’ ss Ar : dr 


Since PP’? = NP’2 4+ NP? (or since cos? ¢ + sin? @ = 1), the element 
of arc length ds is now given by 


8.54 ds? = dr? + r? d@?. 
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Since the area of the thin triangle OPP’ differs by a second-order infi- 
nitesimal from that of a circular sector of radius r and angle A@, which is 
472 AQ, the area of any closed curve surrounding the origin just once is 


2a 
8.55 | r? dO. 
0 


Such a formula can be translated into terms of Cartesian coordinates by 
means of the relations 8.21, which imply 


8.56 dx = drcos@—rsin@dé, dy = drsin® + rcosé dé, 
so that x dy ~ ydx = rcos@ dy — rsin @ dx = r%(cos? 6 + sin? 0)d0 = r? dé 


and 


8.57 ifrans few yan 


This must, of course, be interpreted as 


dy a 
8.58 Bee hy ONG 
3 i ae ar 


where x and y are given in terms of a parameter f¢, and the integration is 
over the values of ¢ that take us all round the curve. 


Figure 8.5c¢ Figure 8.5d 


The same formula can be used to compute the area of the “sector” ob- 
tained by joining the origin to a given arc (Figure 8.5c or d). In polar co- 
ordinates, if the arc goes from @ = 4; to @ = Mo, the area is 


6 
4 i r2 do. 
a 


When we transfer this to Cartesian coordinates, we regard the boundary of 
the sector as a closed “curve” consisting of the arc and the two radii. Since 
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xdy — ydx 
x2 ; 


foe 
x 


the radii (along which y/x remains constant) make a zero contribution to 
the integral on the right side of 8.57. Hence, if the arc goes from t = ft; to 
t = te, the area of the sector is 


"gy zs) 
59 Da 
ae . i ¢ a dt 


{cf. Courant 1, p. 273]. 


EXERCISES 


1. The line x — (¢@ +t’) y + 2itt’ = Ois a secant of the parabola 8.47, meeting it 
in the points whose parameters are ¢ and /’. Making ?’ tend to 1, deduce the equation 


x — Wy +22 =0 


for the tangent at the point whose parameter is 7, so that, if (x1, yi) lies on the parab- 
ola 8.45, the tangent at this point is 


yYiy = Kx + x1). 
2. The line 


x 
~ cos a + ~ sina = cos B 
a b 


is a secant of the ellipse 8.46, meeting it in the points = a + 8. Making f tend to 
0, deduce the equation 


x y. 
—cos¢ + ~sint = 1 
a b 
for the tangent at the point whose parameter is t [Robson 1, p. 274]. Obtain analo- 


gous results for the hyperbola 8.48. Deduce that, if (x1, y1) lies on the central conic 
8.44, the tangent at this point is 


AM a VAD! ce og 
a? b? 
3. At the point 7 on the ellipse 8.46, the normal, being perpendicular to the tangent, is 


Oe 


cosf  sint 


Differentiating partially with respect to ¢ and then eliminating ¢, obtain the envelope 


of normals in the form 
ax % by % 
(aoa) flaca) = 


[Forder 3, pp. 36-37; Lamb 2, p. 350]. Hint: 


Is sin’ tf = — by 3 
a — b2 a — pb 
4. Use 8.56 to reconcile 8.51 and 8.54. 


cos? ¢ = 
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8.6 HYPERBOLIC FUNCTIONS 


The hyperbolic sine and cosine have a property in reference to 
the rectangular hyperbola, exactly analogous to that of the sine 
and cosine with reference to the circle. For this reason the former 
functions are called hyperbolic functions, just as the latter are 
called circular functions. 

E. W. Hobson (1856 -1933) 


[Hobson 1, pp. 329 -330] 


As a very simple application of the formula 8.59, consider the unit circle 
x24 y2= lor 


xX = cos ft, y = sint 
(Figure 8.5c). Since 
& = —sint = -y and D = cost = % 


the area of the sector from ¢ = 0 to any other value is 


if (x2 — »&| dt = if (x? + y?)dt 


= | Be 
0 


which, of course, we knew already. More interestingly (Figure 8.5), if the 
curve is the rectangular hyperbola x? — y2 = 1 or 


x = cosht, y = sinh f, 
so that 


os sinht=y and ® = cosht = x, 


the area of the sector is again 


dy a) i, 
1 _ ae 2 y2 
if [x ie dt =4 ‘ (x? — y?)dt 


= 4 dt = tt. 
0 


Comparing the above results, we see clearly the analogy that relates the 
circular and hyperbolic functions. In Figures 8.5c and d, we have a sector 
AOP of the circle or rectangular hyperbola, respectively. In both cases OA 
= | and the parameter ¢ is twice the area of the sector. In the former, 
OM = costand PM = sint. Inthelatter,OM = coshtand PM = sinht. 
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EXERCISES 
1, Find the area of the ellipse 


xX=acost, y= bsint. 
2. Find the area of the ¢ sector of the general hyperbola 


x = acosht, y = bsinht. 


8.7. THE EQUIANGULAR SPIRAL 


In the equiangular spiral of the Nautilus or the snail-shell or Globige- 
rina, the whorls continually increase in breadth, and do so in a steady 
and unchanging ratio. .. . It follows that the sectors cut off by succes- 
sive radii, at equal vectorial angles, are similar to one another in every 
respect; and that the figure may be conceived as growing continuously 
without ever changing its shape the while. 


Sir D’Arcy W. Thompson (1860 -1948) 
[Thompson 2, pp. 753-754 ] 


The circle r = a may be regarded as the locus of the transform of the point 
(a, 0) by a continuous rotation, which transforms each point (r, 8) into 
(r,@ + t) where ¢ varies continuously. Similarly, the ray (or half line) @ = 0 
is the locus of the transform of (a, 0) by a continuous dilatation, which yields 
(r, 0) for all positive values of r. By judiciously combining these two trans- 
formations, we obtain a continuous dilative rotation. Let u denote the ratio of 
magnification corresponding to rotation through | radian. Then p? is the 
ratio of magnification for 2 radians, u3 for 3 radians, ... , uw” for 7 radians, 

., »’ fort radians. Thus the dilative rotation transforms the general point 
(r, 8) into (u'r, 8 + 4), where ¢ varies continuously. The locus of the trans- 
form of (a, 0) is the equiangular spiral (or “logarithmic spiral”), whose para- 
metric equations : 

r= pla, @=t 


may be combined into the single polar equation 
8.71 r= ap’. 


This curve was first recognized by Descartes, who discussed it in 1638 in his letters to Mersenne. 
Jacob Bernoulli (1654-1705) found it so fascinating that he arranged to have it engraved on his 
tombstone (in the Miinster at Basel, Switzerland) with the inscription 

Eadem mutata resurgo. 

These words (which E. T. Bell translates as “Though changed I shall arise the same”) express a 
remarkable consequence of the way the curve can be shifted along itself by a dilative rotation: 
any dilatation has the same effect on it as a rotation, and vice versa. In fact, the rotation @ > 
6 + a, changing r = ap? into au’** = yr, is equivalent to the dilatation O(u*). Steinhaus [1, 
p. 97] describes this property as an optical illusion. Having drawn an equiangular spiral (Figure 
8.74), he remarks: “If we turn it round (in this case together with the book), it seems to grow 
larger or smaller.” 
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Figure 8.7a 
Since 
adr _ 
de > rlog Bi, 


— 
8.53 shows that the angle ¢, between the position vector OP and the tangent 
at P, is given by 


that is, this angle is constant: a result which could have been foreseen from 
the fact that similarities preserve angles. In terms of this constant angle 4, 
which is about 80° in Figure 8.7a, we can write 
pe — e log “@ = e? cot % 
thus expressing the spiral in its classical form 
8.72 r= ae re (a and ¢ constant). 
By 8.52, 


adr _ 
7 °° o, 


so that r — s cos @ is constant. This shows that the length of the arc from 
r=r,tor=reis 
(r2 — r1) sec ¢, 


and that the length from the origin (r = 0) to the general point (although 
this involves infinitely many turns) is 


r sec ¢. 


EXERCISES 


1. The spiral r = ap® is homothetic to itself by means of the dilatation O(27) 
How is it affected by inversion in the circle r = a? 
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2. Sketch the inverse of an equiangular spiral with respect to a circle whose center 
is on 
(i) _ the spiral itself, 
(ii) the image of the spiral by the half-turn about its pole. 
(Inverses of equiangular spirals are called loxodromes.) 


8.8 THREE DIMENSIONS 


To Kastner, the Analyst, it appeared that the application of algebra to 
geometry would free the student from the stern discipline of Euclid. It 
would enable him to think for himself instead of having to watch the 
lips of his teacher. One of Kastner's sentences could fitly be inscribed 
over the door of any modern school: ’’The greatest satisfaction that we 
know is to find the truth for ourselves," 


J. L. Synge [2, p.174] 


To set up a system of Cartesian coordinates in space we use three axial 
planes meeting by pairs in three axes OX, OY, OZ. Starting from the origin 
O, we reach the general point (x, y, z) by going a distance x along the x-axis 
OX, then a distance y in the direction of the y-axis OY, and finally a distance 
z in the direction of the z-axis OZ. The three axial planes have the equations 
x = 0,y = 0,z = 0, which can be taken in pairs to determine the axes. For 
instance, the z-axis, consisting of all the points (0, 0, z), has the two equa- 
tions x = y = 0. Any line through the origin (0, 0,0) has parametric 
equations 


8.81 x = Xt, y= ¥t, A 
The mutual ratios of the coefficients X, Y, Z determine the direction of the 
line. By a translation to a new origin, we see that the parallel line through 
(1, y1, 21) Is 
8.82 X= xX, 4+ Xt, yoy + ¥t Z= 2, + Zt. 
Eliminating ¢, we obtain the two equations 
MEA We Ee 

xXx Y Z 


which have to be interpreted by a special convention when XYZ = 0. The 
centroid of masses fy at (X1, V1, 21) and fe at (X2, ye, Z2), with + te = 1, 
is 


8.83 


(f1X1 + fexe, t1¥1 + teva, t1Z1 + t2Z2). 


The origin and (3, v1, 21) are opposite vertices of the parallelepiped formed 
by the three pairs of parallel planes 
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(x1,91, 21) 


y 
(0, 0, 0) Zi : 


Figure 8.8a 


x=0,x=%;5 yo yoy, 2=0,2=23; 


as in Figure 8.8a. (In the tongue-twisting word “parallelepiped” we stress 
the syllable “ep,” belonging to the Greek prefix epi, which occurs also in 
such words as “epithet” and “epicycloid.”) For the rest of the present sec- 
tion we shall take the axes to be mutually orthogonal, so that this is a rec- 
tangular parallelepiped (or ““box”). The three-dimensional extension of 
Pythagoras’s theorem shows that the length of the diagonal 


(9,9, 0) (x1, y1, 21) 
is the square root of x12 + y1?2 + 21%. Similarly, the distance between 
(x, y, Z) and (x’, y’, 2’) is the square root of 
(xP 40 —y P+ — 2). 
If the parameter ¢ in 8.81 is adjusted so that 
8.84 X24 Y24 72 = 1, 


it measures the distance from the origin to the general point (x, y, z) on the 
line. The coefficients X, Y, Z, satisfying 8.84, are called the direction cosines 
of the line, because they are the cosines of the angles which the line makes 
with the coordinate axes. More precisely, they are the direction cosines of 
one of the two rays into which the line is decomposed by the origin; the op- 
posite ray has the direction cosines —X, — Y, —Z. Two rays forming an 
angle meet the unit sphere 


x? 4 y2 4 22 = | 


in two points, say (X, Y, Z), (X’, Y’, Z’), whose coordinates are equal to the 
direction cosines of the rays. By “solving” the isosceles triangle which 
these points form with the origin, we obtain the expression 


XX’ + YY’ 4+ ZZ’ 


for the cosine of the angle between the rays. In particular, the rays (and 
therefore also the lines) are at right angles if 


XX’ 4+ YY 4+ ZZ = 0. 
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It follows that the plane through the origin perpendicular to the line 8.81 is 
Xx + Yy + Zz = 0. 
By a translation, we deduce the parallel plane through (x1, 1, 21): 


8.85 Xx + Yy + Zz =T, 
where T = XxX, + Yyy + Zz. Still assuming 
X24 Y24 Z2 = X24 y24 7/2 = 1, 
we deduce that the two supplementary angles between the planes 
Xx+ Yy4+2z=T7, Xx+ y+ Z'2 = T’ 
are the angles whose cosines are 
~(XX’ + YY’ 4+ ZZ’). 


We see now that every plane has a linear equation, and every linear equa- 
tion determines a plane. In particular, the plane that makes intercepts p, g, r 
on the axes is 


Kae ee aa ay cer 

P oq rr 
Two planes of the form 8.85 are parallel if they differ only in their “constant” 
terms T. The line of intersection of two nonparallel planes can be reduced 
to the standard form 8.83 by eliminating first z and then x. 

An equation connecting x, y, z (not necessarily linear) usually represents a 

surface; two such equations together represent a curve, the intersection of 
two surfaces. In particular, an equation 


F(x, y) = 0, 


involving only x and y, represents a cylinder, the locus of a line that passes 
through a variable point on the curve F(x, y) = z = 0 while remaining paral- 
lel to the z-axis. A homogeneous equation 


f%Y,2) = 0 
(whose left side is merely multiplied by a power of u when x, y, z are replaced 


by ux, wy, Zz) represents a cone, the locus of a line that joins the origin to a 
variable point on the curve 


fay, LY) = 0, 251, 
Important instances are the quadric cylinders 
x2 ce y2 


Zt 1 and y? = 2x, 
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including the ordinary cylinder of revolution (or “‘right circular cylinder’) 
x? 4 y2 = k2, and the quadric cones 


ax? + by? + cz? = 0, 


including the cone of revolution (or “right circular cone”) x? + y? = cz?. 
The equation x? + y? 4 z2 = 0, which is satisfied only by (0, 0, 0), may 

be regarded either as a peculiar kind of cone or as a sphere of radius zero. 

The general sphere, having center (x’, y’, z’) and radius k, is, of course, 


(xP +O -y P+ E—7 PR =H. 
We observe that this is an equation of the second degree in which the coeffi- 
cients of x2, y2, z? are all equal while there are no terms in yz, zx, xy. 


The sphere x? + y? + z? = k?, whose center is the origin, inverts the point 
(X, Y, Z) into 


( k2Xx key 4 ) 
X24 Y24 272” X24 y24 72” X24 y24 72)° 
The plane through this inverse point, perpendicular to the line 8.81, namely, 


XxX + Yy + Zz = k?, 


is called the polar plane of (X, Y, Z) with respect to the sphere. If (X, Y, Z) 
lies in the sphere, the polar plane is simply the tangent plane. 


The three-dimensional analogues of the conics are the quadric surfaces or 
quadrics, whose plane sections are conics (or occasionally pairs of lines, 
which may be regarded as degenerate conics). These surfaces, whose equa- 
tions are of the second degree, include not only the elliptic and hyperbolic 
cylinders, the quadric cone, and the sphere, but also the ellipsoid 

x2 y2 72 


BO 


> 


the hyperboloid of one sheet 


x2 y2 ze 


{R. J. T. Bell 1, p. 149, Fig. 41], the hyperboloid of two sheets 


{Salmon 2, p. 80, Fig. 1-4], the elliptic paraboloid 


Pa 2 


= 22 
a pb 


> 


and the hyperbolic paraboloid 
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x2 y2 
8.861 2 = BB = 2z 
{R. J. T. Bell 1, p. 150, Fig. 42]. The nature of these surfaces can be roughly 
discerned by considering their sections by planes parallel to the coordinate 


planes. Their names were invented by G. Monge in 1805 [see Blaschke 1, 
p. 131). 

Important special cases are the quadrics of revolution, formed by revolving 
a conic about one of its axes. For instance, the special ellipsoid obtained by 
revolving an ellipse about its major or minor axis is a prolate spheroid or an 
oblate spheroid, respectively. 

For the investigation of surfaces of revolution it is often convenient to use 
cylindrical coordinates (r, @, z), in which the first two of the three Cartesian 
coordinates are replaced by polar coordinates 


ra Vx2 + y2, @ = arctan y 
x 


while z retains its usual meaning. To revolve a plane curve 
F(x, z) = 9, y=0 
about the z-axis, we simply replace x by r; thus the surface of revolution is 
F(r, z) = 0, 


or, in Cartesian coordinates, F(\/x2 + y2, z) = 0. 
For instance, revolving the hyperbola 8.441 about its conjugate axis, we 
obtain the hyperboloid of revolution (of one sheet) 


p22 x24 y2 22 
ae sae or ae mar ee 


Replacing x? + y? by (x cos a 4+ y sin a)? + (y cosa — x sin a)?, we may 
express this equation in the form 


(x cosa + y sina)? — (az/b)? = — (y cosa — x sina)? + a? 
which shows that, for each value of a, every point on the line 
xcosa+ysina = az/b, ycosa—xsina=a 


lies on the hyperboloid. Allowing «a to vary from 0 to 27, we obtain a con- 
tinuous system of generators: lines lying entirely on the surface. Reflecting 
in the (x, y)-plane by reversing the sign of z, we obtain a second system of 
generators on the same hyperboloid. The plane 


8.87 xX cosa + y sina = az/b, 


through the center, touches the asymptotic cone 
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along the line 


x — yp Zz 


acosa  asina b’ 


and meets the hyperboloid in two parallel lines: one in each of the planes 
ycosa—xsina = +a. 
Another interesting surface of revolution is the ring-shaped ‘orus 


(r — aj? + 22 = Bb? (a > b), 


which is obtained by revolving a circle of radius b about an exterior line in 
its plane, distant a from the center. This surface evidently contains two 
systems of circles: the “meridians,” of radius b, and the “parallels” (in planes 
parallel to z = 0), whose radii vary between a — banda + 5b, Itis less 
obvious that the torus contains also two “oblique” systems of circles of radius 
a, such that two circles of opposite systems meet twice while two distinct 
circles of the same system do not meet at all but are interlocked.* In fact, 
by expressing the equation 


8.88 (Of/x? + y? — a)? + 2 = BP 
in the form 
(x? + y2 + z2 — @@ + b2)2 + A(a? = b?)z2 =_ 4b2(x?2 + y?) 


= 4b? { (x cosa + y sin a)? + (y cos a — x sin a)? } 
or 
(x2 4+ y? + 22 — a2 + b?)? — 4b%(y cos a — x sin a)? 


= 4b%(x cos a + y sin a)? — 4(a? — b?)z2, 


we see that, for each value of a, the torus contains the whole of the section 
of the sphere 


x? 4 y2 4 22 — g2 4 b? = 2b(y cosa — x sin a) 
by the plane 


8.89 b(x cosa + y sina) = Va? — b?z. 
Since the sphere can be expressed as 
(x + bsin a)? + (y — bcosa)? + 22 = a? 


and the plane passes through its center (—b sin a, b cos a, 0), the section is 
a great circle, and its radius is a. Allowing a to vary from 0 to 27, we ob- 


* For drawings of the torus, showing all four systems of circles, see Hermann Schmidt, Die In- 
version und ihre Anwendungen (Oldenbourg, Munich, 1950), p. 82. (The “elevation” is not as 
well drawn as the “plan.”) See also Martin Gardner, Scientific American, 203 (1960), pp. 194, 
196. These circles were discovered by Yvon Villarceau, Nouvelles Annales de Mathématiques (1), 
7 (1848), pp. 345-347. 
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tain a continuous system of such circles, and a second system by reversing 
the sign of z. 

The plane 8.89 meets the torus in two circles, one in each system (with a 
replaced by a + 7 in the second system). Since these two circles are sec- 
tions of the two spheres 


x? + y? + 22 — a? + b? = + 2b(y cosa — x sina), 


their points of intersection are the two “antipodal” points 


(+£=— = cos a, +@ — Bing +6 Va? — b? ) 
a a a 
(with signs agreeing). Since each of these is a point of contact, 8.89 is a 
bitangent plane [R. J. T. Bell 1, p. 267]. - 

Comparing 8.89 with 8.87, we see that the “oblique” circles on the torus 
lie in the same planes (through the center) as the pairs of parallel genera- 
tors of the hyperboloid of revolution 


(This remark is due to A. W. Tucker.) 


EXERCISES 
1. The plane through three given points (xj, yi, 2;) (i = 1, 2, 3) is 


Xy Yi 21 1 

X2 yo 22 1 —0 

x3 3 23 1 i 

x yp z | 

If the requirement of passing through a point is replaced (in one or two cases) by the 
requirement of being parallel to a line with direction numbers X;, Y;, Z;, the correspond- 
ing row of the determinant is replaced by 


Ri Veo Ze =O, 


2. In terms of general Cartesian coordinates, the point (x, y, z) will be transformed 
into 
(—x, —y, —z) by the central inversion O(— 1), 
(Lx, MY, Z) by the dilatation O(u) (§7.6), 


(x, ¥, Z+¢) by a translation along the z-axis. 
3. In terms of rectangular coordinates, the point (x, y, z) will be transformed into 
(x, ¥, —Z) by reflection in the (x, y)-plane, 
(y, x, Z) by reflection in the plane x = y, 
(—y, x, Z) by a quarter-turn about the z-axis, 


(x, —y,Z+¢e) by a glide reflection (87.4). 

4. In terms of cylindrical coordinates, the point (7, @, z) will be transformed into 
(7,6 +a,z +c) bya twist, 
(ur, 8 + a, uz) by the general dilative rotation (§7.6). 
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5. Obtain the condition 
Quu’ + wv’ + 2ww’ = d+d 
for two spheres 
x2 4 y2 4 22 4 Qux + 2vy 4+ 2wz +d =O, 
x2 4 p24 22 4 Du’x 4 2v’y 4 2wz +d’ = 0 
to be orthogonal. 
6. If (X, Y, Z) is outside the sphere x2 + y? + z? = k?, its polar plane contains 
the points of contact of all the tangent planes that pass through (X, Y, Z). 
7. By expressing 8.86 in the form 


(: y. 2 (5 Sane 2 z\2 
-—cosa++sne +{—~cos a ——sinag =14(-] . 
a b b a } € 


find the two systems of generators on the general hyperboloid of one sheet. Two gen- 
erators of opposite systems intersect (or occasionally are parallel), but two distinct 


generators of the same system are skew. This observation applies also to the two sys- 
tems of generators on the hyperbolic paraboloid 8.861. 


9 


Complex numbers 


The extension of the Euclidean plane to the inversive plane (§6.4) or to 
the elliptic plane (§6.9) is the geometric counterpart of a familiar procedure 
in algebra: the extension of the concept of number. Beginning with the 
natural numbers such as | and 2, we proceed to the integers, then to the ra- 
tional numbers, then to the real numbers, then to the complex numbers 
(and if we had time we could continue with hypercomplex numbers*). Each 
stage is motivated by our desire to be able to solve a certain kind of equa- 
tion. Real numbers were understood remarkably well by the ancient Greeks. 
Complex numbers were used freely, by R. Bombelli (in his Algebra, Bologna, 
1572) and especially by Euler, many years before they could be treated rigor- 
ously; that was how the word “imaginary” acquired its technical meaning. 
To put “the square root of minus one” on a firm foundation, it is convenient 
(though not essential) to use a geometric representation. Such an interpre- 
tation was suggested by J. Wallis (1685), formulated completely by C. Wessel 
(1797), rediscovered by J. R. Argand (1806), and rediscovered again by 
Gauss.+ 

The present discussion of number is not intended to be a formal develop- 
ment but rather to emphasize the role of geometry in the working rules. For 
a more complete treatment see Robinson [1, pp. 73-84]. 


9.1 RATIONAL NUMBERS 


“The northern ocean is beautiful,” said the Orc, ‘and beautiful the 
delicate intricacy of the snowflake before it melts and perishes, but 
such beauties are as nothing to him who delights in numbers, spurn- 
ing alike the wild irrationality of life and the baffling complexity of 
nature's laws.” 


J. Lb. Synge [2, p. 101] 


The first numbers that we consider in arithmetic are the natural numbers, 


* See, for instance, Coxeter, Quaternions and reflections, American Mathematical Monthly, 53 
(1946), pp. 136-146. 


} See the excellent article on Complex numbers, by C. C. MacDuffee, in the Encyclopaedia 
Britannica (Chicago edition). 
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forming a sequence that begins with | and never ends. The problem of 
solving such an equation as 


x+2=1 


motivates the discovery of the integers, which include not only the natural 
numbers (or “positive integers’’) but also zero and the negative integers. 
The sequence of integers 


Re ese a eee 


which has neither beginning nor end, is conveniently represented by points 
evenly spaced along an infinite straight line, which we may think of as the 
x-axis of ordinary analytic geometry. In this representation, addition and 
subtraction appear as translations: the transformation x > x + a shifts each 
point through a spaces to the right if a is positive and through —a spaces 
to the left if a is negative; in other words, the operation of adding a is the 
translation that transforms 0 into a. 
The problem of solving such an equation as 


2x = | 


motivates the discovery of the rational numbers p = a/b, where ais an in- 
teger and bis a positive integer; these include not only the integers a = a/1, 
but also fractions such as 1/2 (or 4) and —4/3. The rational numbers can- 
not be written down successively in their natural order, because between 
any two of them there is another, and consequently an infinity of others; 
for example, between a/b and c/d we find (a + c)/(b + d). The corre- 
sponding points are dense on the x-axis, and at first sight seem to cover it 
completely. Multiplication and division appear as dilatations: the trans- 
formation x — yx is the dilatation O(u), where O is the origin (which rep- 
resents zero); in other words, multiplication by pu is the dilatation with cen- 
ter O that transforms | into u. Of course, « may be either positive (Figure 
9.1a) or negative (Figure 9.16). In particular, multiplication by — 1 is the 
half-turn about O. (The point 1 is joined to an arbitrary point on the y- 
axis.) 


Ved v1 0) 1 x 


Figure 9.1a Figure 9.1b 
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We can derive the rational number a/b from the integer a by applying 
the dilatation O(1/), which transforms b into 1. (Figure 9.1c illustrates 
the derivation of the rational numbers 3/2 and —1/2.) This construction 
shows clearly why we cannot allow the denominator b to be zero. There 
would be no harm in allowing b to be negative, but we naturally identify 
a/(—b) with —a/b. In the same spirit we usually write each fraction in its 
“lowest terms,” so that the numerator and denominator have no common 
factor. ; 


° 
o 


ic 
b 


Figure 9.1¢ 


EXERCISE 
Use the method of Figure 9.lc to construct 4. 


9.2 REAL NUMBERS 


"What was that?” inquired Alice. “‘Reeling and Writhing, of course, to 
begin with,” the Mock Turtle replied, ‘and then the different branches 
of Arithmetic—Ambition, Distraction, Uglification, and Derision.”’ 


Lewis Carroll 


[Dodgson 1, Chapter 9] 


The problem of solving such an equation as 
XA 0) 


motivates the discovery of the real numbers, which include not only the ra- 
tional numbers but also the irrational numbers (such as \/2 and 7), which 
cannot be expressed as fractions. (Actually, 7 cannot even be expressed as 
a root of an algebraic equation.) Pythagoras’s proof of the irrationality of 
1/2 was considered by Hardy [2, pp. 32-36] to be one of the most ancient 
instances of first-rate mathematics, “as fresh and significant as when it was 
discovered.” In Cantor’s treatment [Robinson 1, p. 79], a real number is 
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defined to be the limit of a convergent sequence of rational numbers, or 
(more precisely) the set of all sequences “equivalent” (in a specified sense) 
to a given sequence; for example, the real number 7 is the limit of the se- 
quence 


3, 3.1, 3.14, 3.141, 3.1415, 
or of the “equivalent” sequence 
4, 3.2, 3.15, 3.142, 3.1416, 


The corresponding point on the x-axis is defined to be the limit of a con- 
vergent sequence of “rational” points. 

The real numbers can be further subdivided into algebraic numbers (such 
as \/2 and W2), which can be expressed as roots of equations 


9.21 Agx™ + ayx®-1 4+... 4+ a, = O 


with integers for coefficients, and transcendental numbers (such as 7 and e) 
which cannot be so expressed. Among the algebraic numbers, it is some- 
times desirable to distinguish the quadratic numbers, such as \/2 and 
V(2 — \/2), which can be constructed with compasses. 


9.3. THE ARGAND DIAGRAM 


! met a man recently who told me that, so far from believing in the 
square root of minus one, he did not even believe in minus one. This is 
at any rate a consistent attitude. 

There are certainly many people who regard \/2 as something per- 
fectly obvious, but jib at \/(—1). This is because they think they can 
visualize the former as something in physical space, but not the latter. 
Actually \/(— 1) isa much simpler concept. 


E. C. Titchmarsh [1, p. 99] 


The problem of solving such an equation as 
9.31 x?4+1=0. 


motivates the discovery of the complex numbers (so named by Gauss), which 
include not only the real numbers but also such “imaginary” numbers as 
the square root of —1. Since the real numbers occupy the whole x-axis, it 
is natural to try to represent the complex numbers by all the points in the 
(x, y)-plane, that is, to define them as ordered pairs of real numbers with 
suitable rules for their addition and multiplication [Synge 2, Chapter 9]. 
In this so-called Argand diagram (invented by Caspar Wessel in 1797, a few 
years before Argand himself), points are added like the corresponding vec- 
tors from the origin O (which represents 0): 
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9.32 (x,y) + (4,6) = (* +a,y + Bb) 


(Figure 9.3@). In other words, to add (a, b) we apply the translation that 
takes (0, 0) to (a, 5). 


2 (x,y) 
(x,y) + (a,b) 
(x,y) 
O 
O (1,0) (2,0) 
Figure 9.3a Figure 9.3b 


Multiplication by an integer still appears as a dilatation; for instance 


2(x,y) = (% y) + (% ¥) = (2x, 2y) 


(Figure 9.3b). In particular, multiplication by —1 is the half-turn about O. 
What, then, is multiplication by “the square root of —1’? This must be a 
transformation whose “square” is the half-turn about O. The obvious an- 
swer is a quarter-turn about O [Hardy 1, p. 83]. 


(a, b) (x,y) (cr,0 + a) 


(1,0) 


(Cartesian coordinates) (Polar coordinates) 


Figure 9.3¢ 


Then multiplication by an arbitrary complex number should be a trans- 
formation which leaves O invariant and includes both dilatations and rota- 
tions as special cases. The obvious transformation of this kind is a dilative 
rotation (§5.4). Accordingly, the operation of multiplying the general point 
(x, y) by any particular point (a,b) is defined to be the dilative rotation (with 
center O) that transforms (1, 0) into (a, b) [Klein 1, p. 57]. To see how this 
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works, let the points (a, b), (x, y) have polar coordinates (c, a), (r, 8), so that 
a=ccosa, b=csina, x =rcos# y=rsin#@. 


Then the dilative rotation (Figure 9.3c), multiplying r by c and adding a to 
4, transforms 
rcos@ =x 


and rsin@g=y 
into 
cr cos (@ + a) = er(cos @ cos a — sin @ sin a) 
= (¢ cos a)(r cos 4) — (¢ sin a)(r sin 9) 
= ax — by 
and 
cr sin(@ + a) = er(sin@cos a + cos @sin a) 
= (ec cos a\r sin 8) + (c sin a)(r cos @) 
= ay + bx. 


Hence, finally, the rule for multiplication is (in Cartesian coordinates) 
9.33 (a, b)(x, y) = (ax — by, ay + bx) 
{Hardy 1, p. 80]. 
Since (x, y) + (a, 0) = (x 4+ a, y) and (a, O)(x, y) = (ax, ay), we natur- 
ally identify the complex number (a, 0) with the real number a, so that 
a(x, y) = (ax, ay) 
and (x,y) = (*%,04+ Oy) =x 4+ y 0, 1. 


Introducing Euler’s special symbol i to denote the complex number (0, 1), 
we have 
(x,y) =x + yi 
=x+iy 


and 7? = (0, 1)(0, 1) = (—1, 0) = —1. In this notation, the rules 9.32 and 
9.33 become 

(+ y+ (a+ DD = + O40 + dy 

(a + bi\x + yi) = ax — by + (ay + bx), 
which may simply be thought of as ordinary addition and multiplication, 


treating i as an indeterminate, followed by the insertion of — | for 7? [Birk- 
hoff and MacLane 1, pp. 95-97}. 


EXERCISES 
Solve the equation z2 — 4z +5 = 0. 
The equation u + vi = 0 implies both vu = 0 andy = 0. 
Express (a + bi)-1 in the form x + yi. 
Adapt Figure 9.3c to the cases (i) b = 0, (ii) a? + Bb? = 1. 


SoS be 


DE MOIVRE 14] 


9.4 MODULUS AND AMPLITUDE 


Abraham Demoivre (more correctly written as de Moivre)... died in 
London on November 27, 1754... . Shortly before [his death] he de- 
clared that it was necessary for him to sleep some ten minutes or a 
quarter of an hour longer each day than the preceding one. The day 
after he had thus reached a total of something over twenty-three hours 
he slept up to the limit of twenty-four hours, and then died in his sleep. 


W. W. Rouse Ball 
[Ball 2, pp. 383 -384] 


The conversion of Cartesian coordinates to polar coordinates shows that 
any complex number z = x + yi is expressible in the form 


r (cos 6 + i sin @). 


The radial distance r is called the modulus (or “absolute value”) of z, and is 
denoted by |z|. The angle @ is called the amplitude (or “argument”) of z, 
and is denoted by am z. Thus 


|x + yi] = Vx? + y2, am(x 4+ yi) = arctan =. 


Since multiplication by c = a + bi is represented by a dilative rotation 
which is the product of a dilatation in the ratio |c| : 1 and a rotation through 
am c, we have 

|cz| = |e||zI, am (cz) = ame + amz. 


If |c| = 1, multiplication by c is represented by a pure rotation. Such 
a number c is of the form 
cos a + 7 sin a, 


where a = am cis the angle of the rotation. Repeating this rotation n times, 
we obtain de Moivre’s theorem 


(cos a + isin a)" = cos na + i sin na. 
In particular, multiplication by 


= cos 27 4. isin 27 

n n 

is represented by a rotation through 27/n. Applying this rotation and its 

successive powers to the point 1, we obtain the vertices of a regular n-gon, 

{n}, inscribed in the unit circle |z| = 1. These n points represent the roots 
1, w, w?2,..., @—1 


of the cyclotomic equation z* = | (see §2.1). From the theory of equations 
we deduce the identity 


™_J=(z — I — ew) — w)...@ — ow). 
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In particular, the four fourth roots of | are 
1, i, ?@= —l, B= —1. 


Since i4 = 1, the higher powers yield the same cycle over and over again; 


for example, 
®m@= —-1l, 7 = -7, 8 =1, 


Each complex number z = x + yi has a conjugate z = x — yi, derived 
by reflection in the x-axis. Since 


Zax y? = |2P, 
the reciprocal of z is 
ZL |Z|F2 Zz 
In particular, we have 
(cos a + isina)~! = cosa — isina, 


which is de Moivre’s theorem with n = —1. 
Inversion in the circle |z| = 1 transforms r(cos @ + i sin @) into 
r-l(cos 9 + i sin 9), that is, it transforms z into 


2-1 = |2|-2z. 

De Moivre’s theorem is easily extended from integral values of n to ra- 
tional values, and then to real values. In particular, a point continuously 
describing the circle |z| = 1 represents the complex number 
9.41 cos 6 + isin 8 = (cos 1 + isin 1)’, 
where @ varies continuously from 0 to 27. 

Applying the same idea to the complex number 

z = p(cos | 4+ /sin 1), 


of modulus » + 1, we see that its powers 
z? = p* (cos @ + isin @) 
are represented by a point describing the equiangular spiral r = p’ (§ 8.6). 


EXERCISE 


Use de Moivre’s Theorem to find 

(a) the two square roots of 3 4+ 47; 

(b) the three cube roots of 1, say 1, w, w?; 
(c) the six sixth roots of 1 (in terms of w); 
(d) the twelve twelfth roots of 1. 
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9.5 THE FORMULA e7 41=0 


There is a famous formula—perhaps the most compact and famous of 
all formulas—developed by Euler from a discovery of de Moivre: 
e'7 + 1 = 0.... It appeals equally to the mystic, the scientist, the 
philosopher, the mathematician. For each it has tts own meaning. 
Though known for over a century, de Moivre's formula came to Ben- 
jamin Peirce (1809 -1880) as something of a revelation. Having dis- 
covered it one day, he turned to his students... . “Gentlemen,” he 
said, ‘'that is surely true, it is absolutely paradoxical; we cannot un- 
derstand it, and we don't know what it means, but we have proved it, 
and therefore we know it must be the truth.” 


E. Kasner and J. Newman 
[1, pp. 103 -104] 


Having accepted real powers such as p’ = e?'*", we naturally ask 
whether any meaning can be attached to complex powers. A partial answer 
is supplied by the theory of infinite series such as 


_ ae x4 x5 
et = bees beh ae erat ries 
=) x2 xt 
GOSS ec ape es 
bea x3 x3 
SHS = ay 5r 


The first series, which is familiar when x is real, provides a definition for e* in 
other cases; for pari 

= Tielke ae eee 

_ . | i 1 i 

Say ae ig +5 Sake 


(1-3+4- +) r(t-! me +E-..) 


= cos 1 +4 /sin 1. 


These series enable us to express 9.41 in Euler’s concise form 


9.51 cos 6 + isin @ = e%, 


which is a refinement of the identity 


Gi = loge (cos @ + i sin 6) 
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of R. Cotes* (1682-1716), after whose untimely death Newton said, “If Cotes 
had lived, we might have known something!” 
Setting @ = 7in 9.51, we obtain the “famous formula” 
et = —1, 


which connects in such a surprising way the three important numbers 


e = 2.71828..., m = 3.14159..., 


and i. 
EXERCISES 
1. Evaluate e!7!. Is 7 real? 
84 — 94 64 _ p91 
2. From 9.51 deduce cos @ = oe ,sin@ = — , the familiar formulas 
i 


for cos (@ + a), sin (@ + a), and the derivatives of cos 6, sin 6. 


9.6 ROOTS OF EQUATIONS 


Gauss . . . was the first mathematician to use complex numbers in a 
really confident and scientific way. 

G. H. Hardy 

[Hardy and Wright 1, p. 188] 


In the field of complex numbers we can solve any quadratic equation that 
has real coefficients; for example, the equation 9.31 has the two roots i and 
—i. Still more remarkably, we can solve any quadratic equation with com- 
plex coefficients; indeed, not only any quadratic equation but also any equa- 
tion of degree 3 or 4. In saying that we can “solve” an equation we mean 
here that we can find explicit expressions for the roots in terms of the coeffi- 
cients. It was proved by E. Galois (who was murdered in1832 when he was 
only 20 years old) that the general algebraic equation 9.21 withn > 4 cannot 
be solved in this sense [Infeld 1]. Nevertheless, the fundamental theorem 
of algebra (which Gauss proved in 1799) asserts the existence of roots for 
all values of n, even when explicit expressions are not available. (For a 
neat proof, see Birkhoff and MacLane [1, pp, 101-103].) In fact, numerical 
solutions can be found, correct to any assigned number of decimal places. 


EXERCISE 


A ladder, 24 feet long, rests against a wall with the extra support of a cubical box 
of edge 7 feet, placed at the bottom of the wall with one horizontal edge against the lad- 
der. How far up the wall does the ladder reach? (Hint: Take 7x to be the height of the 
top of the ladder above the top of the box, Obtain an equation whose relevant root is 
Ix =9 + 4y2) 


* Harmonia mensurarum, Cambridge, 1722, p. 28. 
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9.7 CONFORMAL TRANSFORMATIONS 


We saw, in § 9.3, that the transformation 
z=z+b 


(which adds to the complex variable z the complex constant 5) is a transla- 
tion, whereas 


(which multiplies z by the complex constant a) is a dilative rotation about the 
point 0, including as special cases a dilatation (when a is real) and a rotation 
(when |a| = 1). It follows that a dilative rotation about the general point 
c is 

z’—c=a(z —c) 
or z=az4+(l— ade. 


Hence the general direct similarity, as described in §5.5, is the general linear 
transformation 


z=az+b 


{Ford 1, p. 3]; and this is a translation or a dilative rotation according as 
a= loral. (Inthe latter case,c = b/(1 — @).) 

Since the product of an opposite similarity and a reflection is direct, any 
given opposite similarity may be expressed as the product of a given reflec- 
tion and a suitable direct similarity. Using the reflection in the x-axis, 
namely 


a 


(§ 9.4), we see that the general opposite similarity is the “conjugate” linear 
transformation 


z= az + b. 
Since the ratio of magnification is again | @|, this is a glide reflection (possi- 
bly reducing to a pure reflection) if |a| = 1, and a dilative reflection other- 
wise, 


We saw, in § 9.4, that the transformation z’ = z—! is the inversion in the 
unit circle |z| = 1. Similarly, 


Z= ke 
Zz 
is the inversion in the circle |z| = k, of radius k. It follows that the in- 
version in the general circle |z — a] =k is z’ — a = k2/(Z — @) or 
2 
9.71 z=z=at+e k 


Z—@a 
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By 6.71, any circle-preserving transformation that is not a similarity is the 
product of such an inversion and an isometry 


Z=pr+q or Zz = pr+@q, 


where |p| = 1. To express this product, we replace the z on the right of 
9.71 by pZ + q or pz + q, obtaining 
2 
Pees k é _az+b 
pz+q-—a z+d 
. 2 
or z-a + Oe eee = az +b 


respectively, where b and d are certain expressions involving k?, p, 9, aand a. 
Hence 

Every circle-preserving transformation, direct or opposite, is a linear frac- 
tional transformation 


yu v@r+b ae yi e@th 


9.72 = 
c+td td 


(ad # be), 


where c may be taken to be 0 or | according as the transformation is, or is not, a 
similarity. 

Conversely, every linear fractional transformation 9.72 transforms circles into 
circles. The easiest way to see this is by direct substitution in the equation 
|z—u| = kor 


(z—uj(Z — 2) = k? 


for the general circle. This is clearly transformed into an equation of the 
same kind. The following alternative procedure is suggested by a remark 
of N. S. Mendelsohn.* 

If c = 0, the transformation is a similarity, as we have seen. If c + 0, 
we could arrange to have c = |, as before; but we shall find it more con- 
venient to use a different normalization, namely, to multiply all of a, b, c, d 
(if necessary) by such a number as to make the revised coefficients satisfy 
ad — be = | {Ford 1, p. 14]. Then we have, in the notation of continued 
fractions, the identity 


az +b _ gi y | I I Waccxy 
+d Cc+—-—-c7l+cec4eld4+2z 


which, of course, continues to hold when we replace z by Z on both sides. 
Thus the homography 


—az+b = ee 
Zo Zid (c #0, ad — be 1) 


* American Mathematical Monthly, 81 (1944), p. 171. 
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may be expressed as the product of the nine simpler transformations 


S l l l 
Zg = aC 14 023,Zg = —,2Z7 =~=C + 26,26 =—,25= —ce714 2%4,Z4 =-, 
27 25 23 
l 
Z3 = C+ 22, z2=>-7-, Z1 = c7!d 4 z, 
21 


which are alternately translations z’ = b + z and MOdbius involutions of 
the special form z” = 1/z: the product of the inversion z” = 1/2’ and the 
reflection z’ = Z. (The number of steps could be reduced from nine to four 
by using the dilative rotation z’ = cz; but it is interesting to observe that 
this more complicated transformation is itself a product of translations and 
“horizontal” Mébius involutions.) For the antihomography 


7a @+b 


cZ +d 
we proceed in the same way with one further reflection z’ = Zz. Since all 
these are circle-preserving transformations, the desired result follows. 


The more powerful methods of the theory of functions of a complex vari- 
able enable us to prove [Ford 1, pp. 3, 15] that every angle-preserving trans- 
formation of the whole inversive plane is of the form 9.72. This shows that 
angle-preserving transformations and circle-preserving transformations are 
synonymous, 


EXERCISES 
1. When ja| = | anda $1, the transformation z’ = az + 4 isa rotation. Find 
its angle. 
2. When |a| ~ 1, the transformation z’ = aZ + bis a dilative reflection. What 
angle does its axis make with the x-axis? 


10 


The five Platonic solids 


We saw, in §4.6, that the Euclidean plane can be filled with squares, four 
at each vertex. If we try to fit squares together with only three at each ver- 
tex, we find that the figure closes as soon as we have used six squares, and 
we have a cube {4, 3}. Similarly, we can fill the plane with equilateral tri- 
angles, six at each vertex, and it is interesting to see what happens if we 
use three, four, or five instead of six. Another possibility is to use penta- 
gons, three at each vertex, in accordance with the symbol {5, 3}. 

With the possible exception of spheres, such polyhedra are the simplest 
solid figures. They provide an easy approach to the subject of topology as 
well as an interesting exercise in trigonometry. They can be defined and 
generalized in various ways {[see, e.g., Hilbert and Cohn-Vossen 1, p. 290}. 


10.1 PYRAMIDS, PRISMS, AND ANTIPRISMS 


Although a Discourse of Solid Bodies be an uncommon and neglected 
Part of Geometry, yet that it is no inconsiderable or unprofitable !m- 
provement of the Science will (no doubt) be readily granted by such, 
whose Genius tends as well to the Practical as Speculative Parts of if, 
for whom this is chiefly intended. 


Abraham Shorp (1651 -1742} 
(Geometry Improv'd, London, 1717, p. 65) 


A convex polygon (such as {n}, where n is an integer) may be described 
as a finite region of a plane “enclosed” by a finite number of lines, in the 
sense that its interior lies entirely on one side of each line. Analogously, a 
convex polyhedron is a finite region of space enclosed by a finite number of 
planes [Coxeter 1, p. 4]. The part of each plane that is cut off by other planes 
is a polygon that we call a face. Any common sidé of two faces is an edge. 

The most familiar polyhedra are pyramids and prisms. We shall be con- 
cerned solely with “right regular” pyramids whose faces consist of a regu- 
lar n-gon and n isosceles triangles, and with “right regular” prisms whose 
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faces consist of two regular n-gons connected by n rectangles (so that there 
are two rectangles and one n-gon at each vertex). The height of such a prism 
can always be adjusted so that the rectangles become squares, and then we 
have an instance of a uniform polyhedron: all the faces are regular poly- 
gons and all the vertices are surrounded alike [Ball 1, p. 135]. Whenn = 4, 
the prism is a cube, which is not merely uniform but regular: the faces are 
all alike, the edges are all alike, and the vertices are all alike. (The phrase 
“all alike’ can be made precise with the aid of the theory of groups. We 
mean that there is a symmetry operation that will transform any face, edge, 
or vertex into any other face, edge, or vertex.) 

The height of an n-gonal pyramid can sometimes be adjusted so that the 
isosceles triangles become equilateral. In fact, this can be done when n < 6; 
but six equilateral triangles fall flat into a plane instead of forming a solid 
angle. A triangular pyramid is called a tetrahedron. Jf three, and there- 
fore all four, faces are equilateral, the tetrahedron is regular. 

By slightly distorting an n-gonal prism we obtain an n-gonal antiprism (or 
“prismatoid,” or “prismoid”), whose faces consist of two regular n-gons 
connected by 2n isosceles triangles. The height of such an antiprism can 
always be adjusted so that the isosceles triangles become equilateral, and 
then we have a uniform polyhedron with three triangles and an n-gon at 
each vertex. Whenn = 3, the antiprism is the regular octahedron. When 
n = 5, we can combine it with two pentagonal pyramids, one on each “base,” 
to form the regular icosahedron [Coxeter 1, p. 5]. A pair of icosahedral dice 
of the Ptolemaic dynasty can be seen in one of the Egyptian rooms of the 
British Museum in London. 

We have now constructed four of the five convex regular polyhedra, 
namely those regarded by Plato as symbolizing the four elements: earth, 
fire, air, and water. The discrepancy between four elements and five solids 
did not upset Plato’s scheme. He described the fifth as a shape that en- 
velops the whole universe. Later it became the quintessence of the medie- 
val alchemists. A model of this regular dodecahedron can be made by fit- 
ting together two “bowls,” each consisting of a pentagon surrounded by 
five other pentagons. The two bowls will actually fit together because their 
free edges form a skew decagon like that formed by the lateral edges of a 
pentagonal antiprism (with isosceles lateral faces), Steinhaus [2, pp. 161- 
163] described a very neat method for building up such a model. From a 
sheet of cardboard, cut out two nets like Figure 10.1la, one for each bowl. 
Run a blunt knife along the five sides of the central pentagon so as to make 
them into hinged edges. Place one net crosswise on the other, with the 
scored edges outward, and bind them by running an elastic band alternately 
above and below the corners of the double star, holding the model flat with 
one hand. Removing the hand so as to allow the central pentagons to move 
away from each other, we see the dodecahedron rising as a perfect model 
(Figure 10.15). 

The most elementary properties of the five Platonic solids are collected 
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Figure 10.1a Figure 10.1b 


in Table II on p. 413. Each polyhedron is characterized by a Schlafli sym- 
bol {p, q}, which means that it has p-gonal faces, g at each vertex. The 
numbers of vertices, edges, and faces are denoted by V, E, and F. They 
can easily be counted in each case, but their significance will become clearer 
when we have expressed them as functions of p and gq. We shall also ob- 
tain an expression for the dihedral angle, which is the angle between the 
planes of two adjacent faces. 


EXERCISES 


1. Give an alternative description of the octahedron (as a dipyramid). 

2. Describe a solid having five vertices and six triangular faces. 

3. Describe the following sections: (i) of a regular tetrahedron by the plane mid- 
way between two opposite edges, (ii) of a cube by the plane midway between two op- 
posite vertices, (iii) of a dodecahedron by the plane midway between two opposite faces. 

4. Six congruent rhombi, with angles 60° and 120°, will fit together to form a 
rhombohedron (“distorted cube”), From the two opposite “acute” corners of this 
solid, regular tetrahedra can be cut off in such a way that what remains is an octa- 
hedron. In other words, two tetrahedra and an octahedron can be fitted together to 
form a rhombohedron. Deduce that the tetrahedron and the octahedron have sup- 
plementary dihedral angles, and that infinitely many specimens of these two solids 
can be fitted together to fill the whole Euclidean space [Ball 1, p. 147]. 


10.2) DRAWINGS AND MODELS 


You boil it in sawdust: you salt it in glue: 
You condense it with locusts and tape: 
Still keeping one principal object in view— 

To preserve its symmetrical shape. 


Lewis Carroll 
[Dodgson 2a, Fit 5] 


Leonardo da Vinci made skeletal models of polyhedra, using strips of 
wood for their edges and leaving the faces to be imagined [Pacioli 1]. When 
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Figure 10.2a 
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such a model is seen in perspective from a position just outside the center 
of one face, this face appears as a large polygon with all the remaining faces 
filling its interior. Such a drawing of the solid is called a Schlegel diagram 
{Hilbert and Cohn-Vossen 1, pp. 145-146]. 

Figure 10.2a shows each of the Platonic solids in three aspects: an ordi- 
nary perspective view, a met which can be folded to make a cardboard model, 
and a Schlegel diagram. Each can be checked by observing the nature of 
a face and the arrangement of faces at a vertex. 


EXERCISES 


1. Sketch a Schlegel diagram for a pentagonal antiprism. 

2. What is the smallest number of acute-angled triangles into which a given ob- 
tuse-angled triangle can be dissected? (F. W. Levi.t) 

3. What is the smallest number of acute-angled triangles into which a square can 
be dissected? (Martin Gardner.*) 


10.3. EULER’S FORMULA 


Euler... overlooked nothing in the mathematics of his age, totally 
blind though he was for the last seventeen years of his life. 


E. T. Bell (2, p. 330] 


The Schlegel diagram for a polyhedron shows at a glance which vertices 
belong to which edges and faces. Each face appears as a region bounded by 
edges, except the “initial” face, which encloses all the others. To ensure a 
one-to-one correspondence between faces and regions we merely have to 
associate the initial face with the infinite exterior region. 

Any polyhedron that can be represented by a Schlegel diagram is said to 
be simply connected or “Eulerian,” because its numerical properties satisfy 
Euler’s formula 

V—E + F=2 


[Hilbert and Cohn-Vossen 1, p. 290], which is valid not only for the Schlegel 
diagram of such a polyhedron, but for any connected “map” formed by a 
finite number of points and line segments decomposing a plane into non- 
overlapping regions: the only restriction is that there must be at least one 
vertex! 

A proof resembling Euler’s may be expressed as follows. Any connected 
map can be built up, edge by edge, from the primitive map that consists of 
a single isolated vertex. At each stage, the new edge either joins an old 
vertex to a new vertex, as in Figure 10.3a, or joins two old vertices, as in 
Figure 10.35. In the former case, V and E are each increased by 1| while 


+ Mathematics Student, 14 (1946). 
* Scientific American, 202 (1960), p. 178. 
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F is unchanged; in the latter, V is unchanged while FE and F are each in- 
creased by 1. In either case, the combination V — E 4+ F is unchanged. 
At the beginning, when there is only one vertex and one region (namely, all 
the rest of the plane), we have 


This value 2 is maintained throughout the whole construction. Thus, Eu- 
ler’s formula holds for every plane map. In particular, it holds for every 
Schlegel diagram, and so for every simply connected polyhedron. (For an- 
other proof, due to von Staudt, see Rademacher and Toeplitz [1, pp. 75- 
76].) 


Figure 10.3a Figure 10.3b 


In the case of the regular polyhedron { p, q }, the numerical properties 
satisfy the further relations 
10.31 qV = 2E = pF. 
In fact, if we count the g edges at each of the V vertices, we have counted 


every edge twice: once from each end. A similar situation arises if we count 
the p sides of each of the F faces, since every edge belongs to two faces. 


We now have enough information to deduce expressions for V, E, F as 
functions of p and q. In fact, 

Coe f VS de 

l l l l l re l 1 

Gq 2 P G@ 2 p 4 


2 
L_ 1 2p + 2q — pq’ 


whence 
10.32 
4p 2pq 4q 
VS 3 FS oe Ee ie ee ee 
2p + 2q — pq 2p + 2q — pq 2p + 2q — pq 


Since these numbers must be positive, the possible values of p and q are 
restricted by the inequality 2p + 2q — pq >0or 


10.33 (p= Deon 4 


Thus p — 2 and q — 2 are two positive integers whose product is less than 
4, namely, 
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1-1 or 2-1 or 1°2 or 3-1 or 1°3. 


These five possibilities provide a simple proof of Euclid’s assertion [Rade- 
macher and Toeplitz 1, pp. 84-87]: 
There are just five convex regular polyhedra: 


{3,3}, {4,3}, {3,4}, {5,3}, (3, 5}. 


The inequality 10.33 is not merely a necessary condition for the existence 
of { p, g} but also a sufficient condition; for in § 10.1 we saw how to con- 
struct a solid corresponding to each solution. 

The same inequality arises in a more elementary manner when we con- 
struct a model of the polyhedron fromits net. At a vertex we have q p-gons, 
each contributing an angle 

(3 
P 


In order to form a solid angle, these g face angles must make a total less 


than 27. Thus 
i 2 Qa 
q ( 2) n< 


whence, as before, (p — 2)(q — 2) < 4. 


Any maker of models soon observes that the amount by which the sum 
of the face angles at a vertex falls short of 27 is smaller for a complicated 
solid like the dodecahedron than for a simple one like the tetrahedron. Des- 
cartes proved that if this amount, say 6, is the same at every vertex, it is 
actually equal to 47/V [Briickner 1, p. 60]. In the case of { p, g}, this is an 
immediate consequence of the formula 10.32 for V, which yields 


Aq T 2 
=~ = Qp + 2q — pq)— =27 — (1-2). 
V ( P Pp @ Pp 


EXERCISES 
1. The number of edges of { p, q} is given by 


Et=pt4qi—}. 


2. Consider an arbitrary polyhedron having p-gonal faces for various values of p, 
and q faces at a vertex for various values of g. Generalize the equations 10.31 in the 
form , 


Xq=2E= 2p, 


where the first summation is taken over all the vertices and the last over all the faces. 
Deduce that every polyhedron has either a face with p = 3 or a vertex with g = 3 (or 
both). (Hint: If not, we would have dq > 4V and =p > 4F) 

3. If the faces are all alike and the edges are all alike and the vertices are all alike, 


the faces are regular. Show by an example that this result for polyhedra is not valid 
for tessellations. 
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10.4 RADII AND ANGLES 


A solid model of { p, g} can evidently be built from F p-gonal pyramids 
of suitable altitude, placed together at their common apex, which is the cen- 
ter O3 of the polyhedron. This point O3 is the common center of three 
spheres: the circumsphere which passes through all the vertices, the mid- 
sphere which touches all the edges at their midpoints, and the insphere which 
touches all the faces at their centers. The circumradius oR appears as a lat- 
eral edge of any one of the pyramids (Figure 10.4), the midradius 1R as 
the altitude of a lateral face, and the inradius 2R as the altitude of the whole 
pyramid. 


Figure 10.4a Figure 10.4b 


Such a p-gonal pyramid has p planes of symmetry (or “mirrors’’) which 
join its apex O3 to the p lines of symmetry of its base. By means of these 
p planes, the solid pyramid is dissected into 2p congruent (irregular) tetra- 
hedra of a very special kind. Let 090,0203 (Figure 10.4c) be such a tetra- 
hedron, so that Og is a vertex of the polyhedron, O, the midpoint of an edge 
OoO’o, Oz the center of a face, and Og; the center of the whole solid. (The 


Figure 10.4¢ 
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net has been drawn to scale for the case of the cube {4, 3}, in which 
Oo01 = O102 = 0203.) Since the plane O,0203 perpendicularly bisects 
the edge Oo90'o, Oo01 is perpendicular to both 0,02 and O,03. Since 
OoO10z2 is the plane of a face, the inradius 030.2 is perpendicular to both 
OoOz and O,O2. Thus the three lines 0901, 0102, O2O3 are mutually per- 
pendicular and the tetrahedron is “quadrirectangular”’: all four faces are 
right-angled triangles. Schlafli called such a tetrahedron an orthoscheme 
{Coxeter 1, p. 137]. 
Many relations involving the radii 


oR = 0003, 1R = 0103, 2R = O203 


can be derived from the four right-angled triangles, in which O90, = / and 
Z Oo020, = a/p. But the whole story cannot be told till we have found 
the angle 

= £00030, 


which is half the angle subtended at the center by an edge [Coxeter 1, pp. 
21-22}. 
Another significant angle is 


y= £0,0302, 


whose complement, Z 0201 0s, is half the dihedral angle of the polyhedron. 
In other words, the dihedral angle is 7 — 2y. 


In seeking these angles, it is useful to define the vertex figure of {p, q}: the 
polygon formed by the midpoints of the g edges at a vertex Oo. This is in- 
deed a plane polygon, since its vertices lie on the circle of intersection of two 
spheres: the midsphere (with center O3 and radius 1R = O30,), and the 
sphere with center Oo and radius / = OgO;. We see from 2.84 that the ver- 
tex figure of {p, q} is a {q} of side 


2 cos. 
Pp 


Since its plane is perpendicular to O3Qp, its center Q is the foot of the per- 
pendicular from O, to 0309 (Figure 10.4), and its circumradius is 

QO, = I cos ¢. 
By 2.81 (with / cos 7/p for /), this circumradius is 


Icos Zesc 2. 
Pp q 


Hence 


10.41 cos ¢ = cos = csc = = cos 7/ a 
Pp q ry q 


[Coxeter 1, p. 21]. 
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The right-angled triangles in Figure 10.4c now yield 
oR = Icsc$, 1R = I cot 4, 


2 
2R2 = ,R?2 — (ico », cosy=—. 
Pp 


In order to eliminate 4, it is convenient to introduce the temporary abbrevia- 
tion 


- 6 T 
k2 = sin?” — cos?™ = sin? = — cos?@, 


P P q 
so that sin @ = kcsca/q. Then 
ls. I T 
R=—sin-— R=—cos-, 
0 aro 1 a 
10.42 2R = cgi Pees 
kK op 4 
10.43 cos P = cos Ge 
q P 


This last result enables us to compute the dihedral angle 


a — 2W = 2 arcsin sar) aa : 
q P 


EXERCISES 
1. Verify that k = sin 7/2c, wherec = (2 + p + qg)((10 — p — q) 


(Coxeter 4, p. 753], and that E = c(c¢ + 1). 

2. Check the values of the dihedral angle given in Table II on p. 413. (Your calcula- 
tions should agree with the observation that the dihedral angles of the tetrahedron and 
octahedron are supplementary. See Ex. 4 at the end of § 10.1.) 

3. If a polyhedron has a circumsphere and a midsphere and an insphere, and if 
these three spheres are concentric, then the polyhedron is regular. 


10.5 RECIPROCAL POLYHEDRA ~ 


The Platonic solid {p, g} has a reciprocal, which may be defined as the 
polyhedron enclosed by a certain set of V planes, namely, the planes of the 
vertex figures at the V vertices of {p,q}. Clearly, its edges bisect the edges 
of {p, q} at right angles. Among these E edges, those which bisect the p 
sides of a face of {p, g} all pass through a vertex of the reciprocal, and those 
which bisect the q edges at a vertex {p, q} form a face of the reciprocal. Thus 


The reciprocal of { p, q} is (9, p}, 


158 THE FIVE PLATONIC SOLIDS 


and vice versa. There is a vertex of either for each face of the other; in fact, 
the centers of the faces of { p, q} are the vertices of a smaller version of the 
reciprocal {q, p} [Steinhaus 1, pp. 72-79}. 


Ch AL 


Figure 10.5a Figure 10.5b 


Figure 10.5a shows how the octahedron {3, 4} arises as the reciprocal of 
the cube {4, 3} (or vice versa), and how the reciprocal of a regular tetra- 
hedron {3, 3} is an equal tetrahedron. The combination of two reciprocal 
tetrahedra, ABCD and A’B’C’D’, occurs in nature as acrystal twin. Pacioli 
{1, Plates XIX, XX] named it ectaedron elevatum. Kepler, rediscovering it a 
hundred years later, called it simply stella octangula. The twelve edges of the 
two tetrahedra are the diagonals of the six faces of a cube (Figure 10.5b). 

By interchanging p and qin the formula 10.32 for V (or F) we obtain the 
formula for F(or V). Similarly, since Q, on O30po (Figure 10.45), is the cen- 
ter of a face of {q, p}, the angular property ¢ of {p, q} is equal to the angular 
property y of {q, p}, and therefore the expression 10.43 for the property y of 
{p, q} could have been derived from 10.41 by the simple device of inter- 
changing p and q. 


EXERCISES 


1. Acube of edge 1, with one vertex at the origin and three. edges along the Cartesian 
axes, has the eight vertices (x, y, z), where each of the three coordinates is either 0 or 
1, independently. 

2. A cube of edge 2, with its center at the origin and its edges parallel to the Car- 
tesian axes, has the eight vertices 


(+1, +1, £)). 


3. Where is the center of the dilatation that relates the cubes described in the two 
preceding exercises? 

4. Obtain coordinates for the vertices of a regular tetrahedron by selecting alter- 
nate vertices of a cube. Find the equations of the face planes and compute the angle 
between two of them. 

5. Obtain coordinates for the vertices of an octahedron by locating the face centers 
of the cube in Ex. 2. Find the equations of the face planes and compute the angle be- 
tween two that contain a common edge. 
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6. The points (x, y, z) that belong to the solid octahedron are given by the inequality 
ixl+tyltlz[<l 


7. If each edge of a regular tetrahedron is projected into an arc of a great circle on 
the circumsphere, at what angles do these arcs intersect? Find the equations of the 
planes of the six great circles. 


11 


The golden section and phyllotaxis 


Euclid’s construction for the regular pentagon depends on the division of 
a line segment in the ratio r: 1, where rt = (\/5 4+ 1)/2. This algebraic 
number is the subject of a book published in 1509, in which Luca Pacioli 
describes its properties or “effects,” stopping at thirteen “for the sake of 
our salvation.” We shall see that, when it is expressed as a continued frac- 
tion [Ball 1, pp. 55-56], all the partial quotients are 1, making it the simplest 
(and slowest to converge) of all infinite continued fractions. The conver- 
gents are found to be quotients of successive members of the sequence of 
Fibonacci numbers 1, 1, 2, 3, 5, 8,..., which any child can begin to write 
though no explicit formula was found till 1843. A subtle manifestation of 
this “divine proportion” in the structure of plants helps to explain the phe- 
nomenon of phyllotaxis, which is most clearly seen in the arrangement of 
cells on the surface of a pineapple. 


11.1 EXTREME AND MEAN RATIO 


! believe that this geometric proportion served the Creator as an idea 
when He introduced the continuous generation of similar objects from 
similar objects. 


J. Kepler (1571 -1630} 


As we see in Figure 2.85, the vertex figure of a regular pentagon of side 
2/ is a line segment of length 7/, where 


11.11 tT = 2cos 7/5. 


In other words, a pentagon PORST of side 1 (Figure 11.1a) has diagonals 
of length 7. Let the diagonals QS, RT intersect in U. They are said to di- 
vide each other according to the golden section (or “in extreme and mean 
ratio”). To see what this means, we observe that the homothetic isosceles 
triangles OTU, SRU yield 


140 
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P Q M U 
S 
Q T 
R 8 A 
H 

Figure 11.14 Figure 11.1b 

OU OF ga R8 2 OS 

US RS PT OU 


(since PT and QU are opposite sides of the rhombus PQUT). Thus the 
diagonal QS is divided at U in such a way that the ratio of the larger part to 
the smaller is equal to the ratio of the whole to the larger part. 

If QU = PT = 1, so that QS = 7, then US = 1-1 and 


1 +r-1= 7, 
Hence + is the positive root of the quadratic equation 
11.12 eof 1=0, 
namely, ge eee = 1.6180339887.... 
Figure 11.15 shows an easy way to extend a given segment QU to S so 
that OS = rQU. (Here M is the midpoint of the side QU of the square 
AQUH, and MS = MH = 4/5 MU.) Euclid (IV.10) used such a procedure 


“to construct an isosceles triangle having each of the angles at the base 
double the remaining one” (QST or PRS in Figure 11.1a). 


Cn 


Figure 11.1¢ 


</ 
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The figure of a pentagon with diagonals can be neatly displayed by tying 
a simple knot in a long strip of paper and carefully pressing it flat (Figure 
11.1). 

EXERCISES 

1. Defining 7 to be the ratio of the diagonal of a regular pentagon to its side, es- 
tablish 11.11 by applying 1.54 to the isosceles triangle PRS. 

2. Show how one further setting of the compasses will yield a point (in Figure 11.15) 
dividing the given segment QU in the ratio: 1. 


11.2 DE DIVINA PROPORTIONE 


Del suo secondo essentiale effecto . . . Del tergo suo singulare effecto 
. .. Del quarto suo ineffabile effecto... ... Del .10. suo supremo 
effecto ... Del suo .11. excellentissimo effecto ... Del suo .12. quasi 
incomprehensibile effecto .. . 


Luca Pacioli (ca. 1445 -1509) 
({Pacioli 1, pp. 6-7] 


Under the beneficent influence of the artist Piero della Francesca (ca. 
1416-1492), Fra Luca Pacioli (or Paccioli) wrote a book about 7, called De 
divina proportione, which he illustrated with drawings of models made by 
his friend Leonardo da Vinci. His enthusiasm for the subject is apparent 
in the above titles that he chose for various chapters. (It is interesting to 
observe how closely some of his old Italian resembles English.) 

“The seventh inestimable effect” is the occurrence of 7 as the circumra- 
dius of a regular decagon of side 1. (We can thus inscribe a pentagon in a 
given circle by first inscribing a decagon and then picking out alternate ver- 
tices.) “The ninth effect, the best of all” is that two crossing diagonals of 
a regular pentagon divide one another in extreme and mean ratio. “The 
twelfth almost incomprehensible effect” is the following property of the regu- 
lar icosahedron {3, 5}. 


Figure 11.2a Figure 11.2b 
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The faces surrounding a vertex of the icosahedron belong to a pyramid 
whose base is a regular pentagon (similar to the vertex figure). Any two 
opposite edges of the icosahedron belong to a rectangle whose longer sides 
are diagonals of such pentagons. Since the diagonal of a pentagon is 7 times 
its side, this rectangle is a golden rectangle, whose sides are in the ratio: 1. 
In fact, the twelve vertices of the icosahedron (Figure 11.2a) are the twelve 
vertices of three golden rectangles in mutually perpendicular planes (Figure 
11.26). A model is easily made from three ordinary postcards (which are 
nearly golden rectangles). In the middle of each card, cut a slit parallel to 
the long sides and equal in length to the short sides. For a practical rea- 
son, the slit in one of the cards must be continued right to the edge. Then 
the cards can easily be put together so that each passes through the middle 
of another, in cyclic order. 


Figure 11.2c 


We see from Figure 11.2c that a golden rectangle can be inscribed in a 
square so that each vertex of the rectangle divides a side of the square in 
the ratior : 1. Identifying this with one of the three “equatorial” squares 
of a regular octahedron, we deduce that an icosahedron can be inscribed in 
an octahedron so that each vertex of the icosahedron divides an edge of the 
octahedron in the ratio 7: 1. 


EXERCISES 


1. Using Cartesian coordinates referred to the planes of the three golden rectangles, 
obtain the 12 vertices of the icosahedron in the form 


(0, £7,421), (£1,0, 7), (47 +1, 0). 
2. These 12 points divide the 12 edges of the octahedron 
(7?, 0,0), (0, +12, 0), (0, 0, 1?) 


in the ratio r: 1. 
3. Obtain coordinates for the 20 vertices of the regular dodecahedron [Coxeter 1, 
p. 53]. 
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11.3 THE GOLDEN SPIRAL 


Archimedes, Leonardo, Newton—all very practical men, but with some- 
thing more. A sense of wonder, perhaps ... Very mysterious, very 
mysterious. And very exciting. That is the thing—excifement. You 
get it by being God, by constructing, and you get it by watching God, 
by observing things as they are. 


J, L. Synge (2, p. 163] 


Figure 11.3a 


The relation 7 = 1 4+ 1/7 shows that the golden rectangle ABDF (Fig- 
ure 11.3a) can be dissected into two pieces: a square ABCH and a smaller 
golden rectangle. (The points B, C, D may be identified with the Q, U, S$ 
of Figure 11.15.) From the smaller rectangle CDFH we can cut off an- 
other square, leaving a still smaller rectangle, and continue the process in- 
definitely. 

Since the rectangle H/EF is homothetic to ABDF, the vertex J of the 
former lies on the diagonal BF of the latter. In fact, the lines AE, BF, CG, 
DH together contain all the vertices of all the rectangles. To derive CDFH 
from ABDF, we may use a dilative rotation whose center is O, the point of 
intersection BF+ DH. This similarity, which transforms each of the 
points A, C, E,G,I,... into the next, and each of the points B, D, F, H, J, 
... into the next, is the product of a negative quarter-turn about O and the 
dilatation O(7-1). Therefore DH is perpendicular to BF. The inverse simi- 
larity, which transforms each rectangle into the next larger one, is the prod- 
uct of a positive quarter-turn about O and the dilatation O(7). 

Since 

OB, = 


Bye 
OD” | 


T , 
CD 
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OC bisects the (right) angle BOD. Thus the lines CG, AE pass through O 
and bisect the angles between BF, DH. 

In terms of polar coordinates with pole O, the dilative rotation that trans- 
forms OE into OC transforms any point (r, 8) into (tr, 9 + 47). Taking OE 
as the initial line, and the length OE as the unit of measurement, so that E is 
(1, 0), we deduce the coordinates (r, $7) for C, (r?, 7) for A, (73, 37) for the 
vertex opposite to A in a new square placed on AF, and so on. Similarly, 
G is (r-1, —42), Tis (t-?, —7) or (7-2, 7), and so on. We thus obtain an 
infinite sequence of points ..., 4 G, E, C, A,... whose polar coordinates 


P= Fr 9=4an 


? 


satisfy the equation r = 72°’, Hence all these points lie on the equiangu- 
lar spiral 

r= wp’, 
where » = 72/7, This true spiral [Cundy and Rollett 1, p. 64] is closely 
approximated by the artificial spiral formed by circular quadrants inscribed 
in the successive squares, as in Figure 11.3a. (But the true spiral cuts the 
sides of the squares at very small angles, instead of touching them.) 


EXERCISES 


1. Verify the conclusion of §5.4 as applied to the similar rectangles ABDF and 
CDFH in Figure 11.3a. 

2. Obtain polar coordinates for the points J, H, F, D, B. They lie on another equi- 
angular spiral which is both congruent (by rotation through what angle?) and homo- 
thetic (by what ratio of magnification?) to the spiral JGECA. 


11.4 THE FIBONACCI NUMBERS 


Sint minimi 1 et 1 quos imaginaberis inaequales. Adde, fient 2. cul 
adde maiorem ! fient 3. cui adde 2 flent 5. cui adde 3 fient 8. cui 
adde 5 fient 13. cui adde 8 fient 21. 


J. Kepler (1, p. 270] 


In 1202, Leonardo of Pisa, nicknamed Fibonacci (“son of good nature”), 
came across his celebrated sequence of integers f, in connection with the 
breeding of rabbits.* He assumed that rabbits live forever, and that every 
month each pair begets a new pair which becomes productive at the age of 
two months. In the first month the experiment begins with a newborn pair 
of rabbits. In the second month there is still just one pair. In the third 
month there are 2; in the fourth, 3; in the fifth, 5; and so on. Let f, de- 
note the number of pairs of rabbits in the nth month. The first few values 
(and their successive ratios) may be tabulated as follows: 


*R.C. Archibald, Golden Section, American Mathematical Monthly, 25 (1918), pp. 232-238. 
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nv] O[ 1/2] 3 | 4 | 5 6 7 8 9 10 
Jot | OF TL) 2] 3,15 8 13 21 34 35 
Sng t/fri | cod VE 2115 | 1.6 11.6 | 1.625 | 1.6154 | 1.6190 | 1.6176 | 1.6182]... 


Four centuries later, Kepler stated explicitly what Fibonacci must surely 
have noticed: that the sum of any two f’s is equal to the next, so that the 
sequence is determined by the recursion formula 


11.41 fo = 9, A=lL te + fri = faye. 


Kepler observed also that the ratios f, : fn41 approach 1: 7 more and more 
closely as n increases. (See the third line of the above table.) However, 
another hundred years passed before R. Simson (1687-1768, who was not 
really the discoverer of the “Simson line”) recognized fn41/f, as being the 
nth convergent of the continued fraction 


I 1 1 
1 +———_—..... 
7s hae 
(The “convergents” are the numbers 
l 3 1 l 5 
1 1 l= 1 | — 
PS ee 2k He eS 
and soon.) To prove that lim (fr41/fn) = 7, Simson made repeated use of 


the relation tr = 1 + 1/, as follows: 


1_ 1 1_ 1 1 
Go ere) ae 

Along with f,, E. Lucas (1842-1891) considered the related numbers g, 
defined by 


rol Seu 


8 =2, 8 = 1, Be + Beri = Sxqe- 


The first few values are as follows: 


Bi oie alee aeRO, eee ela ie 
Bri 2 1 3,447 11 18 | 29 | 47 16 123 |... 


It is easily proved by induction that (for n > 0) 


11.42 8n = fn-1 + fr4i- 


Leaving this as an exercise for the reader, we proceed to establish Lucas’s 
identities 


11.43 fan = fn Bn 


11.44 fans = fr? a tee 


which are obvious when 1 = 0 or |. To prove them both by induction, we 
add the tentative relations 
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fora = fe21 + fe? and for = frSe, 
obtaining = foxy1 = fara + fel fe—1 + fei) + fi? 
= (fr-1 + fedfegi + fr? 
= fi + fits, 
and then add 
for = fre and forgi = fr? Fifecis 
obtaining = foxy2 = fa? + fr fa—1 + fe41) + fies 
= fia fesi + feri fase 


= Srvt 8k+1- 
Similarly, to establish the identity 
11.45 7 = frt + fr—1, 


which is obvious when n = 1 or 2, we add 
tee = fxt + fe-r1 and tel = fait + fi, 
obtaining Te? = frgot + frst. 


The identity 11.45 continues to hold when nis negative, provided we de- 
fine f_« = f-K+2 — fer4i (for k > 0), so that 


fen = (- 1)" fi 


Thus eae es ae ee 

11.46 = (— li (far — fies) 
and 

11.47 (—1)-* = frat — fat 
11.471 = fei — fat}. 


Adding 11.45 (with k for n) and 11.47, we obtain 
11.48 ge = Te 4 (—7)-*. 
Similarly, subtracting 11.471 from 11.45 (with k for n), we obtain 


? 


tT+7-1 


11.49 fea Dafoe ea Cn 


an explicit formula which was discovered by J. P. M. Binet in 1843. 
From 11.48 and 11.49 we immediately deduce 


gk — Sk they, (—1)-k = & <i . 


168 THE GOLDEN SECTION AND PHYLLOTAXIS 

for example, 
Pio afb, pA 4/5 = 2, 
Tm = 9 + 4/5, m6 = 9 — 4,/5, 


EXERCISES 
1. Express the sum of the finite series 
L+l4+243+4+...4+fn 
in terms of fi+o. 
2. Prove Simson’s identity 
Jn-1 fag — fr? = (- 1). 
This is the basis for an amusing puzzle [Ball 1, p. 85].* 


3. Verify Lagrange’s observation that the final digits of the Fibonacci numbers recur 


after a cycle of sixty: 
1,1, 2, 3,5, 8,3, 1,4,...,7,2,9, 1,0. 


4. From the identity 
(I-t-—-@)\l 414284 384...4+ fif@+..jJ=1, 
which implies 


00 00 oe) k 
Shams ge Ses madara SS le a 
0 1-i-?# 0 0 k=0 f=0\J ‘ 


deduce Lucas’s formula for Fibonacci numbers in terms of binomial coefficients: 
a fn n—1 n—2 
for = (8) + (7!) + (852) 
5. Setting ¢ = 0.01 in the identity 2 fi41 m = (1 — ¢ — #)~1, deduce an easy 
method for writing down the first 19 digits in the decimal for +9200 . 


6. Tabulate g,41/g, and find its limit when n tends to infinity. 


* See also Coxeter, The golden section, phyllotaxis, and Wythoff’s game, Scripta Mathematica, 
19 (1953), p. 139. More generally, 


Jnana fate — fn nonce = (—1)" fa fe: 
see A. E. Danese, American Mathematical Monthly, 67 (1960), p. 81. 


BOTANY 169 


11.5 PHYLLOTAXIS 


In the doctrine of Metamorphosis and the enunciation of the Spiral 
Theory we have .. . two remarkable generalizations which, originat- 
ing in the fertile imagination of Goethe, have passed through the chaos 
of Nature Philosophy and emerged ... to form the groundwork of 
our present views of Plant morphology. 


A. H. Church (1865 -1937} 
(Church 1, p. 1] 


The Fibonacci numbers have a botanical application in the phenomenon 
called phyllotaxis (literally “leaf arrangement”). In some trees, such as the 
elm and basswood, the leaves along a twig seem to occur alternately on 
two opposite sides, and we speak of “4 phyllotaxis.” In others, such as the 
beech and hazel, the passage from one leaf to the next is given by a screw 
displacement involving rotation through one-third of a turn, and we speak 
of “4 phyllotaxis.” Similarly, the oak and apricot exhibit 2 phyllotaxis, the 
poplar and pear 3, the willow and almond ¥,;, and so on. We recognize 
the fractions as quotients of alternate Fibonacci numbers, but consecutive 
Fibonacci numbers could be used just as well; for example, a positive rota- 
tion through 3 of a turn has the same effect as a negative rotation through 
3 [Weyl 1, p. 72]. 

Another manifestation of phyllotaxis is the arrangement of the florets of 
a sunflower, or of the scales of a fir cone, in spiral or helical whorls (or 
“parastichies”). Such whorls are particularly evident in a pineapple (Fig- 
ure 11.5a), whose more-or-less hexagonal cells are visibly arranged in rows 
in various directions: 5 parallel rows sloping gently up to the right, 8 rows 
sloping somewhat more steeply up to the left, and 13 rows sloping very 
steeply up to the right. (Sometimes the sense is consistently reversed.) If 
we regard the surface of the pineapple as a cylinder, cut it along a vertical 
line (generator), and spread it out flat on a plane, we obtain a strip between 
two parallel lines which represent two versions of the vertical cut. In Fig- 
ure 11.55 these parallel lines are taken to be the lines x = O and x = 1, 
and the hexagonal cells, numbered successively in the order of their distances 
from the x-axis, appear as the Dirichlet regions of a lattice (§ 4.1). The lat- 
tice point marked 0 appears at the origin, and reappears at (1, 0), as if the 
cylinder were rolled along the plane. The lattice point marked | has co- 
ordinates (t~1, h), where the height # remains to be computed; and for all 
values of n the nth lattice point is (x, y) where y = nh and x is the frac- 
tional part of nr (or of mt~1), namely, 


x = nt — [nr], 


where [nt] is the integral part. The repetition of the strip has the effect that 
the number n appears at the point 


(nt + m, nh), 
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Figure 11.5a 


for all integers m and m, and the numbers in any straight row are in arith- 
metic progression. When the plane is wrapped round the cylinder, such a 
straight line becomes a helix, the shape of the rail of a “spiral” staircase. 

Since the ratios fi,41/fz of consecutive Fibonacci numbers converge to- 
wards 7, the number /;7 is nearly an integer (namely f;,41), that is, its frac- 
tional part is small. Therefore one of the points marked f;, is close to the 
y-axis (especially for large values of k), and the most nearly vertical rows 
are those in which the difference of the arithmetic progressions are Fibo- 
nacci numbers. 

Such an arrangement can be made for any positive value of h. In Fig- 
ure 11.55 the value 74, has been chosen. We soon find by trial that this is 
about the right size to ensure that the neighbors of 0 are the Fibonacci num- 
bers 5, 13, 8 and their negatives, as in a pineapple (Figure 11.5a). In other 
words, the most obvious rows (with the closest spacing) are those, in which 
the arithmetic progression has difference fg = 8. The effect of increasing h 
would be to decrease the obtuse angle between the directions 05 and 08, so 
that at a certain stage this would become a right angle, and then the Dirichlet 
region would be a rectangle instead of a hexagon. Similarly, the effect of 
decreasing h would be to increase the angle between the directions of 8 and 
13, so that at a certain stage this would become a right angle, and after- 
wards the next Fibonacci number 21 would appear as a neighbor of 0. 


THE PINEAPPLE 17! 


Figure 11.5b 


To determine the critical yalues of 4 for which such transitions occur, we 
seek the condition for the points /; and fi+41 to lie in perpendicular direc- 
tions from 0, that is, the condition for the points 


(fut — fasts fc), fesit — fiero, Sesh) 


to lie in perpendicular directions from (0, 0). By 8.22, this condition is 


(fit — fer rfesit — frre) + fe fesih? = 9. 
Using 11.12, 11.41, 11.42, then 11.43, 11.44, and finally 11.46, we deduce 
fifesth? = (fi? + fectr) 7 — ferr8r4s 


= feng t — foryg = 77 2HHD, 
whence 


11.51 h = (fu fega)—? TBF». 


We conclude that the numbers of whorls in various directions (which are 
5, 8, 13 in the case of the pineapple) are fi-1, fx, fr41 for values of A be- 
tween 


(fx-1fe)—? T-*-) and (fa fags)? 77 F+?. 


As a convenient “standard” value between these critical values, we natur- 
ally choose 


h = fix} q—k, 
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For instance, the value chosen in Figure 11.55 is approximately 


7-6 0.055727... 
= = 0.006966.... 
te 8 


The same pattern, interpreted as an arrangement of leaves on a twig, 
could reasonably be called 7; phyllotaxis or, in the general case, fx_1/fk+41 
phyllotaxis. For, as Church [1, p. 5] remarks: “Bonnet ...saw quite clearly 
in the case of the Apricot that successive % cycles were really not vertically 
superposed.” According to the late A. M. Turing, the gradual advance from 
one pair of “parastichy numbers” to another, corresponding to the continu- 
ous decrease of h, may take place during the growth of a single plant. (For 
convenience we have taken the girth of the cylinder as our unit of measure- 
ment; thus / will decrease if the cylinder grows faster in girth than in length.) 

When the cylinder is replaced by a cone, as in the analogous description 
of a fir cone, the lines of the plane no longer represent helices but rather 
concho-spirals [Moseley 1]. By allowing the generators of the cone to be- 
come more and more nearly perpendicular to its axis, we approach a limit- 
ing situation in which the concho-spirals become plane equiangular spirals, 
cutting one another at the same angles as the helices on the cylinder, that 
is, at the same angles as the lines in the plane of the lattice [Church 1, p. 
58]. Fibonacci numbers as large as fio = 55, fir = 89, fiz = 144 arise as 
the numbers of visible spirals in certain varieties of sunflower [see Church 
1, Plates V, VII, XIII, and especially Plate VI]. However, it should be 
frankly admitted that in some plants the numbers do not belong to the se- 
quence of f’s but to the sequence of g’s [Church 1, Plate XXV], or even to 
the still more anomalous sequences 


3, 45, 9. es 5, 2, 7,9, 16,... 


[Church 1, Plate IX]. Thus we must face the fact that phyllotaxis is really 
not a universal Jaw but only a fascinatingly prevalent tendency. 


EXERCISE 


Draw a version of Figure 11.5b with h = 4. Is this close to the value given by 11.51 


Part Ill 
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Ordered geometry 


During the last 2000 years, the two most widely read books have undoubt- 
edly been the Bible and the Elements. Scholars find it an interesting task 
to disentangle the various accounts of the Creation that are woven together 
in the Book of Genesis. Similarly, as Euclid collected his material from 
various Sources, it is not surprising that we can extract from the Elements 
two self-contained geometries that differ in their logical foundation, their 
primitive concepts and axioms. They are known as absolute geometry and 
affine geometry. After describing them briefly in § 12.1, we shall devote 
the rest of this chapter to those propositions which belong to both: propo- 
sitions so fundamental and “obvious” that Euclid never troubled to men- 
tion them. 


12.1 THE EXTRACTION OF TWO DISTINCT GEOMETRIES FROM EUCLID 


The pursuit of an idea is as exciting as the pursuit of a whale. 
Henry Norris Russell (1877 -1957) 


Absolute geometry, first recognized by Bolyai (1802-1860), is the part of 
Euclidean geometry that depends on the first four Postulates without the 
fifth. Thus it includes the propositions I.1~28, IJI.1-19, 25, 28-30; IV.4-9 
(with a suitably modified definition of “square’’”). The study of absolute 
geometry is motivated by the fact that these propositions hold not only in 
Euclidean geometry but also in hyperbolic geometry, which we shall study 
in Chapter 16. In brief, absolute geometry is geometry without the assump- 
tion of a unique parallel (through a given point) to a given line. 

On the other hand, in affine geometry, first recognized by Euler (1707- 
1783), the unique parallel plays a leading role. Euclid’s third and fourth 
postulates become meaningless, as circles are never mentioned and angles 
are never measured. In fact, the only admissible isometries are half-turns 
and translations. The affine propositions in Euclid are those which are pre- 
served by parallel projection from one plane to another [Yaglom 2, p. 17]: 
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for example, I. 30, 33-45, and VI. 1, 2, 4, 9, 10, 24-26. The importance of 
affine geometry has lately been enhanced by the observation that these 
propositions hold not only in Euclidean geometry but also in Minkowski’s 
geometry of time and space, which Einstein used in his special theory of 
relativity. 

Since each of Euclid’s propositions is affine or absolute or neither, we 
might at first imagine that the two geometries (which we shall discuss in 
Chapters 13 and 15, respectively) had nothing in common except Postu- 
lates I and II. However, we shall see in the present chapter that there is a 
quite impressive nucleus of propositions belonging properly to both. The 
essential idea in this nucleus is intermediacy (or “betweenness”), which Eu- 
clid used in his famous definition: 


A line (segment) is that which lies evenly between its ends, 


This suggests the possibility of regarding intermediacy as a primitive con- 
cept and using it to define a line segment as the set of all points between 
two given points. In the same spirit we can extend the segment to a whole 
infinite line. Then, if B lies between A and C, we can say that the three 
points A, B, C lie in order on their line. This relation of order can be ex- 
tended from three points to four or more. 

Euclid himself made no explicit use of order, except in connection with 
measurement: saying that one magnitude is greater or less than another. It 
was Pasch, in 1882, who first pointed out that a geometry of order could be 
developed without reference to measurement. His system of axioms was 
gradually improved by Peano (1889), Hilbert (1899), and Veblen (1904). 

Etymologically, “geometry without measurement” looks like a contradic- 
tion in terms. But we shall find that the passage from axioms and simple 
theorems to “interesting” theorems resembles Euclid’s work in spirit, though 
not in detail. 

This basic geometry, the common foundation for the affine and absolute 
geometries, is sufficiently important to have aname. The name descriptive 
geometry, used by Bertrand Russell [1, p. 382], was not well chosen, be- 
cause it already had a different meaning. Accordingly, we shall follow 
Artin [1, p. 73] and say ordered geometry. 

We shall pursue this rigorous development far enough to give the reader 
its flavor without boring him. The whole story is a long one, adequately 
told by Veblen [1] and Forder [1, Chapter II, and the Canadian Journal of 
Mathematics, 19 (1967), pp. 997-1000}. 

It is important to remember that, in this kind of work, we must define 
all the concepts used (except the primitive concepts) and prove all the state- 
ments (except the axioms), however “obvious” they may seem. 


EXERCISES 


1. Is the ratio of two lengths along one line a concept belonging to absolute ge- 
ometry or to affine geometry or to both? (Hint: In “one dimension,” i.e., when we 
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consider only the points on a single line, the distinction between absolute and affine 
disappears.) 

2. Name a Euclidean theorem that belongs neither to absolute geometry nor to 
affine geometry. 

3. The concurrence of the medians of a triangle (1.41) is a theorem belonging to 
both absolute geometry and affine geometry. To which geometry does the rest of § 1.4 
belong? 

4. Which geometry deals (a) with parallelograms? (b) with regular polygons? (c) 
with Fagnano’s problem (§ 1.8)? 


12.2. INTERMEDIACY 


A discussion of order... has become essential to any understanding of 
the foundation of mathematics. 

Bertrand Russell (1872 - ) 

(Russel! 1, p. 199] 


In Pasch’s development of ordered geometry, as simplified by Veblen, the 
only primitive concepts are points A, B,... and the relation of intermediacy 
{ABC ], which says that Bis between A and C. If Bis not between A and C, 
we say simply “not [ABC].” There are ten axioms (12.21~12.27, 12.42, 12.43, 
and 12.51), which we shall introduce where they are needed among the vari- 
ous definitions and theorems. 


AXIOM 12.21 There are at least two points. 

AXIOM 12.22 If A and B are two distinct points, there is at least one point 
C for which [ABC]. 

AXIOM 12.23 If [ABC], then A and C are distinct: A # C. 

AXIOM 12.24 If [ABC], then [CBA] but not [BCA]. 

THEOREM 12.241 If [ABC] then not [CAB]. 

Proof. By Axiom 12.24, [CAB] would imply not [ABC]. 

THEOREM 12.242 If [ABC], then A #~ BS C (that is, in view of Axiom 
12.23, the three points are all distinct). 


Proof. If B = C, the two conclusions of Axiom 12.24 are contradictory. 
Similarly, we cannot have A = B. 


DEFINITIONS. If A and B are two distinct points, the segment AB is the set 
of points P for which [APB]. We say that such a point P is on the segment. 
Later we shall apply the same preposition to other sets, such as “lines.” 


THEOREM 12.243 Neither A nor B is on the segment AB. 


Proof. If A or B were on the segment, we would have [AAB] or [ABB], 
contradicting 12.242. 
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THEOREM 12.244 Segment AB = segment BA. 
Proof. By Axiom 12.24, [APB] implies [BPA]. 


DEFINITIONS. The interval AB is the segment AB plus its end points A 
and B: eu 
AB=A+AB4+B. 


The ray A/B (“from A, away from B”) is the set of points P for which [PAB]. 
The line AB is the interval AB plus the two rays A/B and B/A: 


line AB = A/B + AB + B/A. 
COROLLARY 12.2441 Interval AB = interval BA; line AB = line BA. 


AXIOM 12.25 If C and D are distinct points on the line AB, then A is on the 
line CD. 


THEOREM 12.251 If Cand D are distinct points on the line AB then 
line AB = line CD. 


Proof. If A, B, C, D are not all distinct, suppose D = B. To prove that 
line AB = line BC, let X be any pointon BC exceptAorB. By 12.25, A, like 
X, ison BC. Therefore Bis on AX, and X is on AB. Thus every point on 
BCisalsoon AB. Interchanging the roles of A and C, we see that similarly 
every point on ABis alsoon BC. Thus AB = BC. Finally, if A, B, C, D are 
all distinct, we have AB = BC = CD. 


COROLLARY 12.2511 Two distinct points lie on just one line. Two dis- 
tinct lines (if such exist) have at most one common point. (Such a common 
point Fis called a point of intersection, and the lines are said to meet in F.) 


COROLLARY 12.2512 Any three distinct points A, B, C on a line satisfy 
just one of the relations [ABC], [BCA], [CAB]. 


AXIOM 12.26 If AB is a line, there is a point C not on this line. 


THEOREM 12.261 If Cis not on the line AB, then A is not on BC, nor B on 
CA: the three lines BC, CA, AB are distinct. 
Proof. By 12.25, if A were on BC, C would be on AB, 


DEFINITIONS. Points lying on the same line are said to be collinear. 
Three non-collinear points A, B, C determine a triangle ABC, which consists 
of these three points, called vertices, together with the three segments BC, CA, 
AB, called sides. 


AXIOM 12.27 If ABC is a triangle and|BCD] and |{CEA\, then there is, on 
the line DE, a point F for which {A FB], (See Figure 12.2a.) 


THEOREM 12.271 Between two distinct points there is another point. 

Proof. Let A and B be the two points. By 12.26, there is a point E not on 
the line AB. By 12.22, there is a point C for which [AEC]. By 12.251, the 
line AC is the same as AE. By 12.261 (applied to ABE), B is not on this 
line: therefore ABCis a triangle. By 12.22 again, there is a point D for which 
[BCD]. By 12.27 there is a point F between A and B. 
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THEOREM 12.272 = In the notation of Axiom 12.27, [DEF). 

Proof. Since F lies on the line DE, there are (by 12.2512) just five possi- 
bilities: F = D, F = E,[EFD],{FDE],{DEF]. Either of the first two would 
make A, B, C collinear. 

If[EFD], we could apply 12.27 to the triangle DCE with [CEA] and [EFD] 
(Figure 12.26), obtaining X on AF with [DXC]. Since AF and CD cannot 
meet more than once, we have X¥ = B, so that [DBC]. Since [BCD], this 
contradicts 12.24. 

Similarly (Figure 12.2c) we cannot have [FDE]. The only remaining pos- 
sibility is [DEF]. 


A D E 
F x Y 
E F D 
B Cc D A A B 
Figure 12.2a Figure 12.2b Figure 12.2¢ 


This proof is typical; so let us be content to give the remaining theorems 
without proofs [Veblen 1, pp. 9-15; Forder 1, pp. 49-55]. 


12.273 A line cannot meet all three sides of a triangle. (Remember that 
the “sides” are not intervals, nor whole lines, but only segments.) 


12.274 If{ABC] and [BCD], then {ABD}. 


12.275 If [ABC] and [ABD] and C # D, then {|BCD] or [BDC] and 
{ACD] or [ADC]. 


12.276 If{ABD] and[ACD] and B 4 C, then|ABC] or |ACB]. 
12.277 If{[ABC] and[ACD], then[BCD] and{ABD]. 
DEFINITION. If[ABC] and [ACD], we write [ABCD]. 


This four-point order is easily seen to have all the properties that we should 
expect, for example, if [ABCD], then [DCBA], but all the other orders are 
false. 

Any point O on a segment AB decomposes the segment into two segments: 
AO and OB. (We are using the word decomposes in a technical sense 
{Veblen 1, p. 21], meaning that every point on the segment AB except O itself 
is on just one of the two “smaller” segments.) Any point O on a ray from 
A decomposes the ray into a segment and aray: AO and O/A, Any point 
O on a line decomposes the line into two “opposite” rays; if [A OB], the rays 
are O/A and O/B. The ray O/A, containing B, is sometimes more conven- 
iently called the ray OB. 

For any integer n > 1, n distinct collinear points decompose their line into 
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two rays and n — | segments. The points can be named Pi, Po,..... Py, in 
such a way that the two rays are Pi/P,, P,/Pi, and the n — 1 segments are 
Py P2, P2P3,..., Pn-1Pha, 


each containing none of the points. We say that the points are in the order 
P,P... Py, and write [P,Pz...P,]. Necessary and sufficient conditions for 
this are 

[PiP2P3), [PoPsP4],..., (Pna—2Pn—1P rl. 


Naturally, the best logical development of any subject uses the simplest or 
“weakest” possible set of axioms. (The worst occurs when we go to the op- 
posite extreme and assume everything, so that there is no development at 
all!) In his original formulation of Axiom 12.27 [Pasch and Dehn 1, p. 2: 
“IV. Kernsatz”] Pasch made the following far stronger statement: If a line in 
the plane of a given triangle meets one side, it also meets another side (or 
else passes through a vertex). Peano’s formulation, which we have adopted, 
excels this in two respects. The word “plane” (which we shall define in 
§ 12.4) is not used at all, and the line DE penetrates the triangle ABC in a 
special manner, namely, before entering through the side CA, it comes 
from a point D on C/B. It might just as easily have come from a point on 
A/B (which is the same with C and A interchanged) or from a point on B/A 
or B/C (which is quite a different story). The latter possibility (with a slight 
change of notation) is covered by the following theorem (12.278). Axiom 
12.27 is “only just strong enough”; for, although it enables us to deduce the 
statement 12.278 of apparently equal strength, we could not reverse the roles: 
if we tried instead to use 12.278 as an axiom, we would not be able to deduce 
12.27 as a theorem! 


Figure 12.2d 


THEOREM 12.278 If ABC is a triangle and{A FB] and|BCD\, then there is, 
on the line DF, a point E for which [CEA]. 

Proof. Take H on B/F (as in Figure 12.2d) and consider the triangle DFB 
with [FBH] and [BCD]. By 12.27 and 12.272, there is a point R for which 
[DRF] and [HCR]. By 12.274, [AFB] and [FBH] imply [AFH]. Thus we 
have a triangle DAF with [AFH] and [FRD}. By 12.27 and 12.272 again, 
there is a point L for which [DLA] and[HRL]. By 12.277, [HCR] and [HRL] 
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imply [CRL]. Thus we have a triangle CAL with [ALD] and [LRC]. By 
12,27 a third time, there is, on the line DR (= DF), a point E for which [CEA]. 


EXERCISES 


1. A line contains infinitely many points. 

2. We have defined a segment as a set of points. At what stage in the above de- 
velopment can we assert that this set is never the nu// set? [Forder 1, p. 50.] 

3. In the proof of 12.272, we had to show that the relation [FDE£] leads to a con- 
tradiction. Do this by applying 12.27 to the triangle BFD (instead of EAF). 

4. Given a finite set of lines, there are infinitely many points not lying on any of 
the lines. 

5. IfABCisa triangle and [BLC], [CMA], [ANB], then there is a point £ for which 
[AEL] and[MEN]. [Forder 1, p. 56.] 

6. If ABC is a triangle, the three rays B/C, A/C, A/B have a transversal (that is, a 
line meeting them all). (K. B. Leisenring.) 

7. If ABC is a triangle, the three rays B/C, C/A, A/B have no transversal. 


12.3. SYLVESTER’S PROBLEM OF COLLINEAR POINTS 


Almost any field of mathematics offered an enchanting world for dis- 
covery to Sylvester. 


E. T. Bell (1, p. 433] 


It may seem to some readers that we have been using self-evident axioms 
to prove trivial results. Any such feeling of irritation is likely to evaporate 
when it is pointed out that the machinery so far developed is sufficiently 
powerful to deal effectively with Sylvester’s conjecture (§ 4.7), which baffled 
the world’s mathematicians for forty years. This matter of collinearity 
clearly belongs to ordered geometry. Kelly’s Euclidean proof involves the 
extraneous concept of distance: it is like using a sledge hammer to crack 
an almond. The really appropriate nutcracker is provided by the follow- 
ing argument. 


THEOREM. If n points are not all collinear, there is at least one line con- 
taining exactly two of them. 


Proof. Let Pi, Pz,..., P, be the n points, so named that the first three 
are not collinear (Figure 12.3a). Lines joining P, to all the other points of 
the set meet the line P2P3 in at most n — 1 points (including Pz and P3). Let 
Q be any other point on this line. Then the line P1Q contains P; but no 
other P;. 

Lines joining pairs of P’s meet the line P,Q in at most CS | + 1 points 
(including P; and Q). Let P,A be one of the segments that arise in the de- 
composition of this line by all these points. (Possibly A = Q.) Then no 
joining line P;P; can meet the “empty” segment P,A. By its definition, A 
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Figure 12.3a Figure 12.3b 


lies on at least one joining line, say P4Ps. If Ps and Ps are the only P’s on 
this line (as in Figure 12.3a) our task is finished. If not, we have a joining 
line through A containing at least three of the P’s, which we can name Pa, 
P;, Pein such an order that the segment APs contains Py but not Ps. (Since 
A decomposes the line into two opposite rays, one of which contains at least 
two of the three P’s, this special naming is always possible. See Figure 
12.3b.) We can now prove that the line P; Ps contains only these two P’s. 


Figure 12.3¢ Figure 12.3d 


We argue by reductio ad absurdum. If the line P,Ps contains (say) Pz, we 
can use 12.27 and 12.278 to deduce that the segment P,A meets one of the 
joining lines, namely, PgP7 or P4P7. In fact, it meets PgPz if [P1P7Ps] (as 
in Figure 12.3c), and it meets P4P7 if [Py PsP7] or | PsP1P7] (as in Figure 
12.3d). In either case our statement about the “empty” segment is contra- 
dicted. 

Thus we have found, under all possible circumstances, a line (P4Ps or 
P1Ps) containing exactly two of the P’s. 


EXERCISE 
Justify the statement that the joining lines meet the line P1Q in at most eS 1) +1 
points. In the example shown in Figure 12.35, this number (at most 11) is only 5; 
why? (The symbol( 4 stands for the number of combinations of 7 things taken / at a 
time; for instance,(5}is the number of pairs, namely 47 @ — 1).) 
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12.4 PLANES AND HYPERPLANES 


If | hyperplanes in n dimensions are so placed that every n but no 
n + 1} have a common point, the number of regions into which they 
decompose the space is 


)*()) +0) + pees) 


Ludwig Schlafli (1814-1895) 
[Schlafli 1, p. 209] 


It is remarkable that we can do so much plane geometry before defining 
a plane. But now, as the Walrus said, “The time has come....” 

DEFINITIONS. If A, B, C are three non-collinear points, the plane ABC 
is the set of all points collinear with pairs of points on one or two sides of 
the triangle ABC. A segment, interval, ray, or line is said to be in a plane 
if all its points are. 


Axioms 12.21 to 12.27 enable us to prove all the familiar properties of 
incidence in a plane, including the following two which Hilbert [1, p. 4] took 
as axioms: 

Any three non-collinear points in a plane a completely determine that 
plane. 

If two distinct points of a line a lie in a plane a, then every point of a 
lies in a, 


DEFINITIONS. An angie consists of a point O and two non-collinear rays 
going out from O. The point O is the vertex and the rays are the sides of 
the angle [Veblen 1, p. 21; Forder 1, p. 69]. Ifthe sides are the rays OA 
and OB, or a; and by, the angle is denoted by 7 AOB or a,b; (or Z BOA, 
ot bya). The same angle a,b, is determined by any points A and B on its 
respective sides. If C is any point between A and B, the ray OC is said to 
be within the angle. 

From here till the statement of Axiom 12.41, we shall assume that all the 
points and lines considered are in one plane. 

A convex region is a set of points, any two of which can be joined by a 
segment consisting entirely of points in the set, with the extra condition that 
each of the points is on at least two non-collinear segments consisting en- 
tirely of points in the set. In particular, an angular region is the set of 
all points on rays within an angle, and a triangular region is the set of all 
points between pairs of points on distinct sides of a triangle. An angular 
(or triangular) region is said to be bounded by the angle (or triangle). 

It can be proved [Veblen 1, p. 21] that any line containing a point of a 
convex region “decomposes” it into two convex regions. In particular, a 
line a decomposes a plane (in which it lies) into two half planes. Two points 
are said to be on the same side of aif they are in the same half plane, on 
opposite sides if they are in opposite half planes, that is, if the segment join- 
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ing them meets a. In the latter case we also say that a separates the two 
points. (It is unfortunate that the word “side” is used with two different 
meanings, both well established in the literature. However, the context will 
always show whether we are considering the two sides of an angle, which 
are rays, or the two sides of a line, which are half planes.) 


Figure 12.4a 


As we remarked in § 12.2, any point O on a line a decomposes a into two 
rays, Say @; and az. Any other line b through O is likewise decomposed by 
O into two rays b, and be, one in each of the half planes determined by a. 
Each of these rays decomposes the half plane containing it into two angu- 
lar regions. Thus any two intersecting lines a and 6 together decompose 
their plane into four angular regions, bounded by the angles 


a1b1, biaz, debe, bear, 


as in Figure 12.4a. The opposite rays a; and ag are said to separate the 
rays by and bg; they likewise separate all the rays within either of the angles 
ab, b1a2 from all the rays within either of the angles adgbo, bga1. We also 
say that the rays a; and b, separate all the rays between them from ag, be, 
and from all the rays within b,d2, agb2, or bea. 

It follows from the definition of a line that two distinct points, A and B, 
decompose their line into three parts: the segment AB and the two rays 
A/B, B/A. Somewhat similarly, two nonintersecting (but coplanar) lines, 
a and b, decompose their plane into three regions, One of these regions lies 
between the other two, in the sense that it contains the segment AB for any 
A ona and Bon b. Another line c is said to lie between a and b if it meets 
such a segment AB but does not meet a or b, and we naturally write [acb}. 


12.401 If ABC and A’B'C are two triads of collinear points, such that 
the three lines AA’, BB’, CC’ have no intersection, and if | ACB}, then [A’C’B’). 

Analogous consideration of an angular region yields 

12.402 If ABC and A’B'C’ are two triads of collinear points on distinct 
lines, such that the three lines AA’, BB’, CC’ have a common point O which is not 


between A and A’, nor between B and B’, nor between C and C’, and if{[ACB}, 
then [A’C'B4. 


We need one or more further axioms to determine the number of dimen- 
sions. If we are content to work in two dimensions we say 


AXIOM 12.41 Ail points are in one plane. 
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If not [Forder 1, p. 60], we say instead: 
AXIOM 12.42 If ABC is a plane, there is a point D not in this plane. 


We then define the tetrahedron ABCD, consisting of the four non-coplanar 
points A, B, C, D, called vertices, the six joining segments AD, BD, CD, 
BC, CA, AB, called edges, and the four triangular regions BCD, CDA, 
DAB, ABC, called faces, The space (or “3-space’’) ABCD is the set of all 
points collinear with pairs of points in one or two faces of the tetrahedron 
ABCD. 

We can now deduce the familiar properties of incidence of lines and planes 
{Forder 1, pp. 61-65]. In particular, any four non-coplanar points of a space 
determine it, and the line joining any two points of a space lies entirely in the 
space. If Q is in the space ABCD and Pis in a face of the tetrahedron 
ABCD, then PQ meets the tetrahedron again in a point distinct from P. 

If we are content to work in three dimensions, we say 


AXIOM 12.43 All points are in the same space. 


Figure 12.4b 


Consequently: 


THEOREM 12.431 Two planes which meet in a point meet in another point, 
and so in a line. 

Proof. Let P be the common point and a one of the planes. Take A, 
B, C in a so that P is inside the triangle ABC. Let DPQ be a triangle in 
the other plane $ (Figure 12.45). If D or Q lies in a, then a and B have 
two common points. If not, PQ meets the tetrahedron ABCD in a point R 
distinct from P; and DR, in B, meets the triangle ABC in a point common 
to a and B. 


If, on the other hand, we wish to increase the number of dimensions, we 
replace 12.43 by 


AXIOM 12.44 If AoA1A2A3 is a 3-space, there is a point A4 not in this 
3-space. 
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We then define the simplex AgA1A2A3A,4 which has 5 vertices A;, 10 edges 
AiA; (i < Jf), 10 faces A,AjAx (i <j < k), and 5 cells A,A;A,A, (which are 
tetrahedral regions.) The 4-space ApA1A2A3Az4 is the set of points collinear 
with pairs of points on one or two cells of the simplex. 

The possible extension to m dimensions (using mathematical induction) 
is now clear. The n-space ApA1... An is decomposed into two convex re- 
gions (half-spaces) by an (n— 1)-dimensional] subspace such as ApA1...An-1, 
which is called a hyperplane (or “prime,” or “(w— 1)-flat”). 


EXERCISES 


1. Any 5 coplanar points, no 3 collinear, include 4 that form a convex quadrangle. 

2. Aray OC within Z AOB decomposes the angular region into two angular regions, 
bounded by the angles AOC and COB. [Veblen 1, p. 24.] 

3. If m distinct coplanar lines meet in a point O, they decompose their plane into 
2m angular regions [Veblen 1, p. 26]. 

4. If ABC isa triangle, the three lines BC, CA, AB decompose their plane into seven 
convex regions, just one of which is triangular. 

5. If m coplanar lines are so placed that every 2 but no 3 have a common point, 
they decompose their plane into a certain number of convex regions. Call this num- 
ber f(2, m). Then 

fQ, m) = f2,m = 1) +m. 
But f(2, 0) = 1. Therefore f(2, 1) = 2, f(2, 2) = 4, f(2, 3) = 7, and f(2, m) 


=lim+4+ (7): 


6. Ifm planes in a 3-space are so placed that every 3 but no 4 have a common point, 
they decompose their space into (say) f(3, m) convex regions. Then 
£3, m) = f3,m — 1) + fm — I). 
But f(3,0)= 1. Therefore f(3,1) = 2, (/3,2=4, /3.3)=8 fG,4 = 15, and 


fG.m)=1+4 (7) ape (3) Be (°3)- [Steiner 1, p. 87.] 


7. Obtain the analogous result for m hyperplanes in an n-space. 


12.5 CONTINUITY 


Nothing but Geometry can furnish a thread for the labyrinth of the 

composition of the continuum .. . and no one will arrive at a truly 
solid metaphysic who has not passed through that labyrinth. 

G. W. Leibniz (1646-1716) 

(Russell 2, pp. 108 -109] 


Between any two rational numbers (§ 9.1) there is another rational num- 
ber, and therefore an infinity of rational numbers; but this does not mean 
that every real number (§ 9.2) is rational. Similarly, between any two points 


DEDEKIND’S AXIOM 187 


(12.271) there is another point, and therefore an infinity of points; but this 
does not mean that the axioms in § 12.2 make the line “continuous.” In fact, 
continuity requires at least one further axiom. There are two well-recog- 
nized approaches to this subtle subject. One, due to Cantor and Weier- 
strass, defines a monotonic sequence of points, with an axiom stating that 
every bounded monotonic sequence has a limit [Coxeter 2, Axiom 10.11]. The 
other, due to Dedekind, obtains a general point on a line as the common 
origin of two opposite rays [Coxeter 3, p. 162]. Its arithmetical counter- 
part is illustrated by describing \/2 as the “section” between rational num- 
bers whose squares are less than 2 and rational numbers whose squares are 
greater than 2. Dedekind’s Axiom, though formidable in appearance, is 
the more readily applicable; so we shall use it here: 


AXIOM 12.51 For every partition of all the points on a line into two non- 
empty sets, such that no point of either lies between two points of the other, 
there is a point of one set which lies between every other point of that set and 
every point of the other set. 


This axiom is easily seen to imply several modified versions of the same 
statement. Instead of “the points on a line” we could say “the points on a 
ray” or “the points on a segment” or “the points on an interval.” (In the 
last case, for instance, the rest of the line consists of two rays which can be 
added to the two sets in an obvious manner.) Another version [Forder 1, 
p. 299] is: 

THEOREM 12.52 For every partition of all the rays within an angle into 
two nonempty sets, such that no ray of either lies between two rays of the other, 
there is a ray of one set which lies between every other ray of that set and every 
ray of the other set. 


To prove this for an angle 4 AOB, we consider the section of al] the rays 
by the line AB, and apply the “segment” version of 12.5] to the segment AB. 


12.6 PARALLELISM 


In the last few weeks | have begun to put down a few of my own Medi- 
tations, which are already to some extent nearly 40 years old. These | 
had never put in writing, so | have been compelled three or four times 
to go over the whole matter afresh in my head. 


C.F. Gauss (1777 -1855) 


(Letter to H. K. Schumacher, May 17, 1831, as translated by Bonola 
(1, p. 67]) 


The idea of defining, through a given point, two rays parallel to a given 
line (in opposite senses), was developed independently by Gauss, Bolyai, 
and Lobachevsky. The following treatment is closest to that of Gauss. 
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THEOREM 12.61 For any point A and any line r, not through A, there are 
just two rays from A, in the plane Ar, which do not meet r and which separate 
all the rays from A that meet r from all the other rays that do not. 


Figure 12.6a 


Proof. Taking any two distinct points B and C on r, we apply 12.52 to 
the angle between the rays AC and A/B (marked in Figure 12.64). We con- 
sider the partition of all the rays within this angle into two sets according 
as they do or do not meet the ray C/B. Clearly, these sets are not empty, 
and no ray in either set lies between two in the other. We conclude that 
one of the sets contains a special ray p; which lies between every other ray 
of that set and every ray of the other set. 

In fact, pi belongs to the second set. For, if it met C/B, say in D, we 
would have [BCD]. By Axiom 12,22, we could take a point E such that 
[CDE], with the absurd conclusion that AE belongs to both sets: to the first, 
because E is on C/B, and to the second, because AD lies between AC and 
AE. 

We have thus found a ray pi, within the chosen angle, which is the “first” 
ray that fails to meet the ray C/B; this means that every ray within the 
angle between AC and p; does meet C/B. Interchanging the roles of B 
and C, we obtain another special ray qi, on the other side of AB, which 
may be described (for a counterclockwise rotation) as the “last” ray that 
fails to meet B/C. Since the line r consists of the two rays B/C, C/B, along 
with the interval BC, we have now found two rays pi, qi, which separate all 
the rays from A that meet r from all the other rays (from A) that do not. 
{Forder 1, p. 300.] 

These special rays from A are said to be parallel to the line r in the two 
senses: p; parallel to C/B, and qi parallel to B/C. (Two rays are said to 
have the same sense if they lie on the same side of the line joining their in- 
itial points.) 

For the sake of completeness, we define the rays parallel to r from a point 
A on r itself to be the two rays into which A decomposes r. The distinction 
between affine geometry and hyperbolic geometry depends on the question 
whether, for other positions of A, the two rays pi, gi are still the two halves 
of one line. If they are, this line decomposes the plane into two half planes, 
one of which contains the whole of the line r. If not, the lines p and q (which 
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contain the rays) decompose the plane into four angular regions 
P1qdi, 41P2, P22, 2p. 


In this case, by 12.61, r lies entirely in the region piqi. 


COROLLARY 12.62 For any point A and any line r, not through A, there is 
at least one line through A, in the plane Ar, which does not meet r. 


Figure 12.6b Figure 12.6¢ 


Another familiar property of parallelism is its “transmissibility’’: 


THEOREM 12.63 The parallelism of a ray and a line is maintained when 
the beginning of the ray is changed by the subtraction or addition of a segment. 

Proof [Gauss 1, vol. 8, p. 203]. Let p; be a ray from A which is parallel 
to a line r through B, and let A’ be any point on this ray (Figure 12.65) or on 
the opposite ray pe (Figure 12.6c). The modified ray p’1, beginning at A’, 
is A’/A or A’A, respectively; it obviously does not meet r. What remains 
to be proved is that every ray from A’, within the angle between A’B and 
p’1, does meet r. Let D be any point on such a ray (Figure 12.65) or on its 
opposite (Figure 12.6c). Since p, (from A) is parallel tor, the line AD (con- 
taining a ray within the angle between AB and p1) meets r, say in C, The 
line A’B, separating A from D, meets the segment AD, say in E. By Axiom 
2.27, applied to the triangle CBE with [BEA’] and [EDC] (Figure 12.66) or 
to the triangle BCE with [CED] and [EA’B] (Figure 12.6c), the line A’D 
meets BC. Thus p’; is parallel to r. 


This property of transmissibility enables us to say that the line p = AA’ 
is parallel to the line r = BC, provided we remember that this property is 
associated with a definite “sense” along each line. 


Busemann [1, p. 139 (23.5)] has proved that it is not possible, within the 
framework of two-dimensional ordered geometry, to establish the “sym- 
metry” of parallelism: that if p is parallel to r then ris parallel to p. To sup- 
ply this important step we need either Axiom 12.42 [as in Coxeter 3, pp. 
165-177] or the affine axiom of parallelism (13.11) or the absolute axioms 
of congruence (§ 15.1). 
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THEOREM 12.64 Jf two lines are both parallel to a third in the same sense, 
there is a line meeting all three. 


Proof. We have to show that, if lines p and s are both parallel to r in the 
same sense, then the three lines p, r, s have atransversal. In affine geometry 
this is obvious, so let us assume the geometry to be hyperbolic. Of the two 
lines parallel to r through a point A on p, one is p itself. Let g be the other, 
and let r be in the angular region piqi, so that the rays pi and qi (from A) 
are parallel to r in opposite senses and s is parallel to r in the same sense as 
p1. Let B and D be arbitrary points on r and s, respectively. 

If D is in the region p,q, the line AD is a transversal. If D is in piqe, 
BD is a transversal. If Dis in peqs, both AD and BD are transversals. 
Finally, if D is in peqi, AB is a transversal. 


Hyperbolic geometry will be considered further in Chapters 15, 16, and 20. 


EXERCISES 


1. Ifpis parallel to s and [prs], then p is parallel to r. (See Figure 15.2¢ with s for q.) 

2. Consider all the points strictly inside a given circle in the Euclidean plane. Re- 
gard all other points as nonexistent. Let chords of the circle be called lines. Then all 
the axioms 12.21-12.27, 12.41, and 12.51 are satisfied. Locate the two rays through 
a given point parallel to a given line. Note that they form an angle (as in Figure 16.25). 


13 


Affine geometry 


The first three sections of this chapter contain a systematic development 
of the foundations of affine geometry. In particular, we shall see how length 
may be measured along a line, though independent units are required for 
lines in different directions. In §§ 13.4-7 we shall investigate such topics 
as area, affine transformations, lattices, vectors, barycentric coordinates, 
and the theorems of Ceva and Menelaus. Finally, in § 13.8 and § 13.9, we 
shall extend these ideas from two dimensions to three. 

According to Blaschke [1, p. 31; 2, p. 12], the word “affine” (German 
affin) was coined by Euler. But it was only after the launching of Klein’s 
Erlangen program (see Chapter 5) that this geometry became recognized as 
a self-contained discipline. Many of the propositions may seem familiar; 
in fact, most readers will discover that they have often been working in the 
affine plane without realizing that it could be so designated. 

Our treatment is somewhat more geometric and less algebraic than that 
of Artin’s Geometric Algebra [Artin 13 see especially pp. 58, 63, 71]. Inci- 
dentally, we shall find that our Axiom 13.12 (which he calls DP) implies 
Theorem 13.122 (his D,): this presumably means that his Axiom 4b implies 
4a. 


13.1. THE AXIOM OF PARALLELISM AND THE ‘‘DESARGUES’”’ AXIOM 


Mathematical language is difficult but imperishable. | do not believe 
that any Greek scholar of to-day can understand the idiomatic under- 
tones of Plato's dialogues, or the jokes of Aristophanes, as thoroughly 
as mathematicians can understand every shade of meaning in Archi- 
medes’ works. 


M. H. A. Newman 
(Mathematical Gazette 43, 1959, p. 167) 


In this axiomatic treatment, we regard the real affine plane as a special 
case of the ordered plane. Accordingly, the primitive concepts are point 
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and intermediacy, satisfying Axioms 12.21-12.27, 12.41 and 12.51. Affine 
geometry is derived from ordered geometry by adding the following two 
extra axioms: 


AXIOM 13.11. For any point A and any line r, not through A, there is at 
most one line through A, in the plane Ar, which does not meet r. 


AXIOM 13.12 If A, A’, B, B’, C, C’,, O are seven distinct points, such that 
AA’, BB’, CC’ are three distinct lines through O, and if the line AB is parallel to 
A’B’, and BC to B’C’, then also CA is parallel to C’A’. 


The affine axiom of parallelism (13.11) combines with 12.62 to tell us that, 
for any point A and any line r, there is exactly one line through A, in the 
plane Ar, which does not meet r. Hence the two rays from A parallel to r 
are always collinear, any two lines in a plane that do not meet are parallel, 
and parallelism is an equivalence relation. The last remark comprises three 
properties: 


Parallelism is reflexive. (Each line is parallel to itself.) 

Parallelism is symmetric. (If p is parallel to r, then r is parallel to p.) 

Parallelism is transitive. (If p and q are parallel to r, then p is parallel to 
q. Euclid I. 30.) 


In the manner that is characteristic of equivalence relations, every line 
belongs to a pencil of parallels whose members are all parallel to one an- 
other. 


Figure 13.14 Figure 13.1b 


Axiom 13.12 (see Figure 13.1a) is probably familiar to most readers either 
as a corollary of Euclid VI.2 or as an affine form of Desargues’s theorem. 
We shall see that it implies 


THEOREM 13.121 If ABC and A’B’C’ are two triangles with distinct ver- 
tices, so placed that the line BC is parallel to B'C’, CA to C’A’, and AB to 
A’B’, then the three lines AA’, BB’, CC’ are either concurrent or parallel. 

Proof. If the three lines AA’, BB’, CC’ are not all parallel, some two of 
them must meet. The notation being symmetrical, we may suppose that 
these two are AA’ and BB’, meeting in O, as in Figure 13.15. Let OC meet 
B’C.in Cy. By Axiom 13.12, applied to AA’, BB’, CC,, the line A Cis paral- 
lel to A’C; as well as to A’C’.. By Axiom 13.11, Cy lies on A’C’ as well as 
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on B’C’. Since A’B’C’ is a triangle, C, coincides with C’. Thus, if 4A’, 
BB’, CC’ are not parallel, they are concurrent [Forder 1, p. 158]. 


Roughly speaking, Axiom 13.12 is the converse of one half of Theorem 
13.121. The converse of the other half is 


THEOREM 13.122 If A, A’, B, B’, C, C’ are six distinct points on three 
distinct parallel lines AA’, BB’, CC’, so placed that the line AB is parallel to 
A'B’, and BC to B’C’, then also CA is parallel to C’A’. 

Proof. Through A’ draw A’C, parallel to AC, to meet B’C’ in Cj, as in 
Figure 13.1lc. By 13.121, applied to the triangles ABC and A’B’C,, since 
AA’ and BB’ are parallel, CC, is parallel to both of them, and therefore 
also to CC’. Hence C; lies on CC’ as well as on B’C’. Since the parallel 
lines BB’ and CC’ are distinct, B’ cannot lie on CC’. Therefore C, coincides 
with C’, and A’C’ is parallel to AC. 


Figure 13.1c¢ 


EXERCISES 
1. Ifa line in the plane of two parallel lines meets one of them, it meets the other 
also. 


2. Can we always say, of three distinct parallel lines, that one lies between the other 
two? 


13.2 DILATATIONS 


Dilatatians . . . are one-to-one maps of the plane onto itself which 
move all points of a line into points of a parallel line. 


E. Artin (1, p. 51] 


Four non-collinear points A, B, C, D are said to form a parallelogram 
ABCD if the line AB is parallel to DC, and BC to AD. Its vertices are the 
four points; its sides are the four segments AB, BC, CD, DA, and its diag- 
onals are the two segments AC, BD. Since B and D are on opposite sides 
of AC, the diagonals meet in a point called the cenrer [Forder 1, p. 140]. 
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As in § 5.1, we define a dilatation to be a transformation which transforms 
each line into a parallel line. But now we must discuss more thoroughly 
the important theorem 5.12, which says that two given segments, AB and 
A’B’, on parallel lines, determine a unique dilatation AB — A’B’. 


Cc’ 


Figure 13.2a 


For any point P, not on AB, we can find a corresponding point P’ by 
drawing A’P’ parallel to AP, and B’P’ parallel to BP, as in Figure 5.la. (The 
lines thus drawn through A’ and B’ cannot be parallel, for, if they were, A P 
and BP would be parallel.) Similarly, another point C yields C’, as in Figure 
13.2a. By 13.121, the three lines AA’, BB’, CC’ are either concurrent or 
parallel. So likewise are AA’, BB’, PP’. 

If the two parallel lines AB and A’B’ do not coincide, it follows that the 
four lines AA’, BB’, CC’, PP’ are all either concurrent or parallel. Then, 
by 13.12 or 13.122 (respectively), CP and C’P’ are parallel, so that the trans- 
formation is indeed a dilatation. If the lines AB and A’B’ do coincide, we 
can reach the same conclusion by regarding the transformation as AC > A’C’ 
instead of AB —> A’B’. 

We see now that a given dilatation may be specified by its effect on any 
given segment. The inyerse of the dilatation AB — A’B’ is the dilatation 
A’B’ — AB. The product of two dilatations, AB — A’B’ and A’B’ + A”B”, 
is the dilatation AB — A”B’. In particular, the product of a dilatation with 
its inverse is the identity, AB + AB. Thus all the dilatations together form 
a (continuous) group. 

The argument used in proving 5.13 shows that, for a given dilatation, the 
lines PP’ which join pairs of corresponding points are invariant lines. The 
discussion of 5.12 shows that all these lines are either concurrent or parallel. 

If the lines PP’ are concurrent, their intersection O is an invariant point, 
and we have a cenrral dilatation 


OA > OA’ 


(where A’ lies on the line OA). The invariant point O is unique; for, if O 
and O; were two such, the dilatation would be OO, + OOj,, which is the 
identity. 
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Figure 13.2b 


If, on the other hand, the lines PP’ are parallel, there is no invariant point, 
and we have a translation AB — A’B’, where not only is AB parallel to A’B’ 
but also AA’ is parallel to BB’. If these two parallel lines are distinct, 
AA’B’B is a parallelogram. If not, we can use auxiliary parallelograms 
AA’C’C and C’CBB’ (or AA’ D’D and D’DBB’) as in Figure 13.2b, Two ap- 
plications of 13.122 suffice to prove that, when A, B, A’ are given, B’ is inde- 
pendent of the choice of C(or D), Hence 


13.21 Any two points A and A’ determine a unique translation A > A’. 


We naturally include, as a degenerate case, the identity, A > A. It follows 
that a dilatation, other than the identity, is a translation if and only if it has 
no invariant point. Moreover, a given translation may be specified by its 
effect on any given point; in fact, the translation A — A’ is the same as 
B — B’ if AA'B’B is a parallelogram, or if, for any parallelogram AA’C’C 
based on AA’, there is another parailelogram C’CBB’. 

We next prove that dilatations are “ordered transformations:” 


13.22 The dilatation AB — A'B’ transforms every point between A and B 
into a point between A’ and B’. 

Proof. Ifthe lines AB and A’B’ are distinct, the fact that [A CB] implies 
[A’C’B’] follows at once from 12.401 (for a translation) or 12.402 (for a central 
dilatation), To obtain the analogous result for two corresponding triads on 
an invariant line CC’, we draw six parallel lines through the six points, as in 
Figure 13.2c, and use the fact that [acb] implies [a’c’b’]. 


To prove Theorem 3.21, which says that the product of two translations is a 
translation, we can argue thus: since translations are dilatations, the product 
is certainly a dilatation. Ifit is not a translation it has a unique invariant 
point O. If the first of the two given translations takes O to O’, the second 
must take O’ back to O. But the translation O’ — O is the inverse of 
O — O’. Thus the only case in which the product of two translations has 
an invariant point is when one of the translations is the inverse of the other. 
(By our convention, the product is still a translation even then.) Hence 


13.23 The product of two translations A > B and B - C is the translation 
A>C. 
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Figure 13.2c 


To prove that this is a commutative product (as in 3.23), we consider first 
the easy case in which the two translations are along nonparallel lines. 
Completing the parallelogram ABCD, we observe that the translations 
A— Band B > Care the same as D + Cand A - D, respectively. Hence 
their product in either order is the translation A > C: 


(A > B(B > OC) = (AS DDC) 
= (B> C\A = B). 

To deal with the product of two translations T and X along the same line, 
let Y be any translation along a nonparallel line, so that X commutes with 
both Y and TY. Then 

TXY = TYX = ATY 


and therefore TX = XT 
{cf. Veblen and Young 2, p. 76]. 


As a Special case of 5.12, we see that any two distinct points, A and B, are 
interchanged by a unique dilatation AB — BA, or, more concisely, 


Ao, 


which we call a half-turn. (Of course, A Bis the same as B<> A.) If Cis 
any point outside the line AB, the half-turn transforms C into the point Din 
which the line through B paraltel to AC meets the line through A paral- 
lel to BC (Figure 13.2d), Therefore ADBC is a parallelogram, and the same 
half-turn can be expressed as C<> D. The invariant lines AB and CD, being 
the diagonals of the parallelogram, intersect in a point O, which is the in- 
variant point of the half-turn. It follows that any segment AB has a mid- 
point which can be defined to be the invariant point of the half-turn A <> B, 
and we have proved that the center of a parallelogram is the midpoint of each 
diagonal, that is, that the two diagonals “‘bisect’ each other. To see how the 
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half-turn transforms an arbitrary point on AB, we merely have to join this 
point to C (or D) and then draw a parallel line through D (or C). 

By considering their effect on an arbitrary point B, we may express any 
two half-turns as A <> Band BoC. If their product has an invariant point 
O, each of them must be expressible in the form O < O’, that is, they must 
coincide. In every other case, there is noinvariant point. Hence 


13.24 The product of two half-turns A <> B and B < C is the translation 
A>C. 


Cc B 


Figure 13.2d 


We have seen (Figure 13.2d¢) that, if ADBC is a parallelogram, the half- 
turn A < Bis the same as C © D, and the translation A — D is the same 
as C > B. This connection between half-turns and translations remains 
valid when the parallelogram collapses to form a symmetrical arrangement 
of four collinear points, as in Figure 13.2e: 


A Cc D B 


Figure 13.2e 


13.25 The half-turns A < B and C < D are equal if and only if the trans- 
lations A + D and C — B are equal. 


In fact, the relation (A < B) = (C — D) implies 
(A > D)= (AS B(BeD) 
= (Co D(D & B) = (C > B) 
and, conversely, the relation (A > D) = (C — B) implies 
(A@B)=(A > D\(D <= B) 
= (C> B(B oD) = (CD). 


In the special case when C and D coincide, we call them C’ and deduce that 
C’ is the midpoint of A B if and only if the translations A > C’ and C’ > B 
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are equal. This involves the existence of parallelograms AC’A’B’ and 
A’B’C’B, as in Figure 13.2 Completing the parallelogram B’C’A’C, we ob- 
tain a triangle ABC with A’, B’, C’ at the midpoints of its sides. Hence 


13.26 The line joining the midpoints of two sides of a triangle is parallel to 
the third side, and the line through the midpoint of one side parallel to another 
passes through the midpoint of the third. 


Figure 13.2f 


Two figures are said to be homothetic if they are related by a dilatation, 
congruent if they are related by a translation or a half-turn. In particular, 
a directed segment AB is congruent to its “‘opposite” segment BA by the half- 
turn A <> B. Thus, in Figure 13.2f, the four small triangles 

ACB’, C’BA'’, BA’C, ABC 
are all congruent, and each of them is homothetic to the large triangle ABC. 


EXERCISES 


1. Such equations as those used in proving 13.25 are easily written down if we re- 
member that each must involve an even number of double-headed arrows (indicating 
half-turns). Explain this rule. 

2. The translations 4 + C and D — B are equal if the translations A — D and 
C —> Bare equal. (This is obvious when 4 DBC is a parallelogram, but remarkable 
when all the points are collinear.) 


3. Setting A = C in the equation 
(Ao BIB SOC) = (A O, 


deduce that any given point C is the invariant point of a half-turn (C < B)(B > C) 
which, by a natural extension of the symbolism, may be written as 
Cec. 

4. Ifthe three diagonals of a hexagon (not necessarily convex) all have the same 
midpoint, any two opposite sides are parallel (as in Figure 4.le). 

5. From any point 4, on the side BC of a triangle ABC, draw A,B; parallel to BA 
to meet CA in By, then B,C, parallel to CB to meet AB in C,, and then C142 parallel 
to AC to meet BC in Ao. If A, is the midpoint of BC, Ae coincides with it. If not, 


continue the process, drawing 42Be parallel to BA, BoC, parallel to CB, and CoA; 
parallel to AC, The path is now closed: A; coincides with A1. (This is called Thom- 
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Figure 13.2g 


sen’s figure. See Geometrical Magic, by Nev R. Mind, Scripta Mathematica, 19 
(1953), pp. 198-200.) 

6. The midpoints of the four sides of any simple quadrangle are the vertices of a 
parallelogram (Figure 13.2g; cf. Figure 4.2c). This theorem was discovered by Pierre 
Varignon (1654-1722). It shows that the bimedians, which join the midpoints of op- 
posite sides of the quadrangle, bisect each other. Thus the corollary to Hjelmslev’s 
theorem (§ 3.6) becomes an affine theorem when we replace the hypotheses 3.61 by 


AB = BC, A’B = BC. 
7. The midpoints of the six sides of any complete quadrangle are the vertices of a 
centrally symmetrical hexagon (of the kind considered in Ex. 4, above). 


13.3 AFFINITIES 


“Yes, indeed,” said the Unicorn, .. . ‘What can we measure? ... 


We are experts in the theory of measurement, not its practice.” 


J.L. Synge [2, p. 51] 


The results of § 13.2 may be summarized in the statement that all the trans- 
lations of the affine plane form a continuous Abelian group, which is a sub- 
group of index 2 in the group of translations and half-turns; and the latter is 
a subgroup (of infinite index) in the group of dilatations [Veblen and Young 
2, pp. 79, 93]. 

Moreover, the group of translations is a normal subgroup (or “self-con- 
jugate” subgroup)* in the group of dilatations, that is, if T is a translation 
while S is a dilatation, then S-!TS is a translation [Artin 1, p. 57]. To 
prove this, suppose if possible that the dilatation S-!TS has an invariant 
point. Since this invariant point could have been derived from a suitable 
point O by applying S, we may denote it by OS. Thus S-!TS leaves O§ in- 
variant. But S-1TS transforms O* into OT®, Hence OTS = O8. Applying 
S-1, we deduce OT = O, which is absurd (since T has no invariant point). 

If Tis A > Band S is AB > ASBS, then S-!TS is A® — BS. Accord- 
ingly, it is sometimes convenient to write T® for S—1TS [see, e.g., Coxeter 1, 
p. 39] and to say that the dilatation S transforms the translation T into the 
translation TS. (Since A*B% is parallel to AB, T* has the same direction as 
T.) In other words, a dilatation transforms the group of translations into 

* Birkhoff and MacLane 1, p. 141; Coxeter 1, p. 42. 
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itself in the manner of an automorphism: if it transforms T into T® and 
another translation U into US, it transforms the product TU into (TU)* 
= T*U® and any power of T into the same power of T®. 

It is convenient to use the italic letter J for the point into which the trans- 
lation T transforms an arbitrarily chosen initial point (or origin) 7. Then, if 
a central dilatation S has / as its invariant point, it not only transforms T into 
T® but also transforms Tinto 7*. 


Figure 13.3a 


Applying to the arbitrary point J all the integral powers of a given transla- 
tion X, we obtain a one-dimensional lattice consisting of infinitely many 
points “evenly spaced” along a line, as in Figure 13.34. We may regard 
every such point X“ as being derived from the point X by a dilatation 
1X — 1X* (which leaves the point / invariant). At first we take pu to be an 
integer; but since the same dilatation transforms each X” into 


(X“)" = xen, 


we can consistently extend the meaning of X*“ so as to allow p» to have any 
rational value, and finally any real value. In other words, we can interpolate 
new points between the points of the one-dimensional lattice and then define 
X*“, for any real p, to mean the translation J + X*. The details are as 
follows. 

For each rational number p, = a/b (where a is an integer and b is a positive 
integer) we derive from the point X¥ a new point X“ by means of the dilata- 
tion 1X 1X, A convenient way to construct this point X“ is to use the 
lattice of powers of an arbitrary translation Y along another line through the 
initial point J, drawing a line through the point Y parallel to the join of the 
points Y® and X¢, as in Figure 13.3b (cf. Figure 9. Ic). 

To verify that the order of such points X“ agrees with the order of the ra- 
tional numbers p, we take three of them and reduce their p’s to a common 
denominator so as to express them as X*/b, Xa/b, Xevb, Ifa, < ag < as, 
so that [Xe Xe Xa], we can apply 13.22 to the dilatation 1X® + 1X, with 
the conclusion that 


[Xab Xasv/b Xas/b], 


If wis irrational, we define X“ to be the Dedekind section between all the 
rational points X°/+ for which a/b < pw and all those for which a/b > u. 
More precisely, supposing for definiteness that p is positive, we apply the 
“ray” version of 12.51 to two sets of points, one consisting of all the points 


ARCHIMEDES’ AXIOM 201 


yo 


ye 


x alb xe 


Figure 13.3b 


whose exponents are positive rational numbers less than p, and all the points 
between pairs of these, whereas the other set consists of the rest of the “posi- 
tive’ ray 1X. (If is negative, we make the same kind of partition of the 
“negative” ray 1/X.) Finally X* is, by definition, the translation 1] > X*. 

We have now interpreted the symbol X“ for all real values of py (includ- 
ing 0 and 1, which yield X®° = 1 and X! = X). Conversely, every point on 
the line 1X can be expressed in the form X*. 

This is obvious for any point of the interval from X-1 to X. Any other 
point T satisfies either [7 X T] or [J X-1 7]. If [J X T), the dilatation 
1T — 1X transforms X into a point between / and X, say X*. The inverse 
dilatation 1X* + 1X transforms X into X1”; therefore T = X1”. If, on 
the other hand, [J X-1 T], we make the analogous use of JT — 1X-1, In 
either case we obtain an expression for T as a power of X. 

Thus, assuming Dedekind’s axiom, we have proved the “axiom of Archi- 
medes”: 


13.31 For any point T (except 1) on the line of a translation X, there is 
an integer n such that T lies between the points 1 and X*. 


The exponent p provides a measure of distance along the line 1X. In fact, 
the segment X’X" (vy < p) is said to have lengrh » — vin terms of the segment 
1X as unit: 

X’X* 
pe ae 


Along another line / Y (Figure 13.3c) we have an independent unit. Since 
the dilatation 1X — 1X* transforms the point Y into Y*, where the line X* Y* 
is parallel to XY, we have 

1x# _ 1y# 

1X ly 


in agreement with Euclid VI.2 (see § 1.3). Thus we can define ratios of the 
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lengths on one line, or on parallel lines, and we can compare such ratios on 
different lines. But affine geometry contains no machinery for comparing 
lengths in different directions: it is a meaningless question whether the trans- 
lation Y is longer or shorter than X. 


y" 


I x x" 


Figure 13.3¢ 


The above definition for the length of the segment X’X# (v < p) suggests 
the propriety of allowing the oppositely directed segment X“X’ to have the 
negative length» — yn. This convention enables us to write p = 1X*/1X for 
negative as well as positive values of », and to add lengths of collinear seg- 
ments according to such formulas as 


AB +BC=AC, BC+CA+AB=—0, 


regardless of the order of their end points A, B, C. 

Now, to set up a system of affine coordinates in the plane, we let (x, y) de- 
note the point into which the origin J is transformed by the translation X*Y¥, 
This simple device establishes a one-to-one correspondence between points 
in the plane and ordered pairs of real numbers. In particular, the point X? is 
(x, 9), ¥¥ is (0, y), and the origin itself is (0, 0). When x and y are integers, the 
points (x, y) form a two-dimensional lattice, as in Figure 4.1b. The remaining 
points (x, y) are distributed between the lattice points in the obvious manner. 

In affine coordinates (as in Cartesian coordinates) a line has a linear equa- 
tion. The powers of the translation X-» Y@ transform the origin into the 
points (—yb, wa) whose locus is the line ax + by = 0. The same powers 
transform (x1, yi) into the points 


(41 — pb, yi + pa) 
whose locus is 
ax — x1) + b(y — y1) = O. 
We can thus express a line in any of the standard forms 8.11, 8.12, 8.13. 
A dilatation is a special case of an affinity, which is any transformation (of 
the whole affine plane onto itself) preserving collinearity. Thus, an affinity 


transforms parallel lines into parallel lines, and preserves ratios of distances 
along. parallel lines. It also preserves intermediacy (compare 13.22). 
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13.32 An affinity is uniquely determined by its effect on any one triangle. 


For, if it transforms a triangle /X Y into /’X’Y’, it transforms the point 
(x, y) referred to the former triangle into the point having the same coordi- 
nates referred to the latter. Here 1X Y and 1’X’Y’ may be any two triangles 
[Veblen and Young 2, p. 72], and we naturally speak of “the affinity 
IXY — I'X’Y’.” In particular, if ABC and ABC’ are two triangles with a 
common side, ABC -» ABC’ is called a shear or a strain according as the line 
CC’ is or is not parallel to AB. One kind of strain is sufficiently important 
to deserve a special name and a special symbol: the affine reflection A(CC’) 
or B(CC’), which arises when the midpoint of CC’ lies on AB. In other 
words, any triangle ACC’ determines an affine reflection A(CC’) whose 
mirror (or “axis’’) is the median through A and whose direction is the direc- 
tion of all lines parallel to CC’. 

In the language of the Erlangen program (see page 67), the principal group 
for affine geometry is the group of all affinities. 


EXERCISES 
1. The shear or strain ABC — ABC’ leaves invariant every point on the line AB 
What is its effect on a point P of general position? 
2. Every affinity of period 2 is either a half-turn or an affine reflection. 


3. If, for a given affinity, every noninvariant point lies on at least one invariant line, 
then the affinity is either a dilatation or a shear or a strain. 


4. In terms of affine coordinates, affinities are “affine transformations” 


x =ax+by41, 
13.33 V=zext+ dy + m, ad + be. 
5. Describe the transformations 
@ ww =x4¢l1vW=y; Gi) x’ =ax,y = ay; 


(ij) x =x+ by, yay; (iv) xX =ax,yY=y. 


13.4 EQUIAFFINITIES 


For he, by Geometrick scale, 
Could take the size of Pots of Ale, 


Samuel Butler (1600 -1 680) 
(Hudibras, |.1) 


We are now ready to show how the comparison of lengths on parallel lines 
can be extended to yield a comparison of areas in any position [cf. Forder 
1, pp. 259-265; Coxeter 2, pp. 125-128]. For simplicity, we restrict con- 
sideration to polygonal regions. (Other shapes may be included by a suit- 
able limiting process of the kind used in integral calculus.) Clearly, any 
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polygonal region can be dissected into a finite number of triangles.* Follow- 
ing H. Hadwiger and P. Glur [Elemente der Mathematik, 6 (1951), pp. 97- 
120], we declare two such regions to be equivalent if they can be dissected 
into a finite number of pieces that are congruent in pairs (by translations or 
by half-turns). In other words, two polygonal regions are equivalent if they 
can be derived from each other by dissection and rearrangement. Super- 
posing two different dissections, we see that this kind of equivalence, which 
is obviously reflexive and symmetric, is also transitive; two polygons that are 
equivalent to the same polygon are equivalent to each other. 


oO R oO 


P P’ @Q Q P P’ 


Figure 13.4a Figure 13.4b 


The parallelograms OPQR and OP’O’R of Figure 13.4a are equivalent, 
since each of them consists of the trapezoid OP’QR plus one of the two con- 
gruent triangles OPP’, RQQ’. In some such cases, more than two pieces 
may be needed, but we find eventually: 


13.41 Two parallelograms are equivalent if they have one pair of opposite 
sides of the same length lying on the same pair of parallel lines. 


Since a parallelogram can be dissected along a diagonal to make two tri- 
angles that are congruent by a half-turn, it follows that two triangles (such as 
OPQ and OP’Q’ in Figure 13.4b) are equivalent if they have a common ver- 
tex while their sides opposite to this vertex are congruent segments on one 
line. In particular, if points Po, Pi, ..., Py are evenly spaced along a line 
(not through O), so that the segments PoP, PiP2, ... are all congruent, 
as in Figure 13.4c, then the triangles OPoPi, OP; P2,... are all equivalent, 
and we naturally say that the area of OPoP, is n times the area of OPoP1. 
By interpolation of further points on the same line, we can extend this idea 
to all real values of n, with the conclusion that, if Q is on the side PQ’ ofa 
triangle OPQ’, as in Figure 13.4d, the Cevian OQ divides the area of the tri- 
angle in the same ratio that the point Q divides the side: 


OPQ _ PQ 
OPQ’ PQ’ 
*N. J. Lennes, American Journal of Mathematics, 33 (1911), p. 46. 


13.42 
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We naturally regard this ratio as being negative if P lies between Q and Q’, 
that is, if the two triangles are oppositely oriented. 


a eo P 


Figure 13.4¢ Figure 13.4d 


These ideas enable us to define the area of any polygon in such a way that 
equivalent polygons have the same area, and when two polygons are stuck to- 
gether to make a larger polygon, the areas are added. To compute the area 
of a given polygon in terms of a standard triangle OA B as unit of measure- 
ment, we dissect the polygon into triangles and add the areas of the pieces, 
each computed as follows. 

By applying a suitable translation, any given triangle can be shifted so that 
one vertex coincides with the vertex O of the standard triangle OAB. Ac- 
cordingly, we consider a triangle OPQ. Let the line PQ meet OA in P’, and 
OB in Q’, as in Figure 13.4d. Multiplying together the three ratios 


OPQ PQ OPQ’ _OP' OAQ' OQ’ 
OPO’ P’'Q’’ OAQ’ OA’ OAB” OB 


we obtain the desired ratio 


OPQ_ PQ OP'0@’ 


3.4 = 
= OAB P’Q’OA OB 


To obtain an analytic expression for the area of a triangle OPQ, referred 
to axes through the vertex O, we take the coordinates of the points 


O, A, B, Pi Q, P’, QO’ 
to be 


(0, 0), di, 0), (0, 1), (x2, ¥1)> (X2, ye), (p, 0), (0, 4); 
respectively. Since the equation 
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for the line PQ is satisfied by (x1, yz) and (x2, yz), we have 
| —yi/g_ 1 _ 1 —yo/q 


XY Pp X2 
whence = Ao — Xeyr “ 
Xy, — Xe 
Taking the product of 
PQ x1 — x2 OP _ OO _ X1ye a XV 
PO Pp > OA = Pp, OB aoe RY ss Ke ’ 
we obtain 
OPQ Xy Vi 
44 ee = = : 
13.4 O4B X1ye2 Xe V1 as 


We deduce, as in § 8.2, that a triangle 
(x1, yx) (x2, y2) (x3, ¥3), 


of general position, has area POR, where 


Xt Yu 1 

POR 
13.45 “a = XQ y2 1 
x3 y3 1 


Since the homogeneous linear transformation 
x’ = ax + by, y' = cx + dy 
takes the triangle OAB to 
(0, 0)(4, c)(6, a), 
we conclude that the affinity 13.33 preserves area if and only if 


ad — be = 1. 


An area-preserving affinity is called an equiaffinity (or “equiaffine collinea- 
tion” [Veblen and Young 2, pp. 105-113]). The group of all equiaffinities, 
like the group of all dilatations, includes the group of all translations and 
half-turns as a normal subgroup, and is itself a normal subgroup in the group 
of all affinities. Equiaffinities are of many kinds. Here are some examples: 


The hyperbolic rotation (“Lorentz transformation” or “Procrustean stretch”) 


13.46 x= polx, y= py (u>0, p-#)), 


for which x’y’ = xy, leaves invariant each branch of the hyperbola xy = 1. 


Th c ossed hyperbolic rotation 
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13.47 x= —plx, y= —py (uh > 0, pI) 
interchanges the two branches. The parabolic rotation 


13.48 xXx=xtl p=2xtyst il, 


, 


for which x’? — y’ = x2 — y, leaves invariant the parabola y = x?. The 
elliptic rotation 


13.49 x =xcosé—ysnd, y’ = xsin@+ ycosé 


leaves invariant the ellipse x2 + y? = 1, and is periodic if # is commensur- 
able with z. 


In §2.8 (page 36) we derived a regular polygon PoP1P2... from a point 
Po (other than the center) by repeated application of a rotation through 
27/n. (The rotation takes Po to Py, Py to P2, and so on.) Although mea- 
surement of angles has no meaning in affine geometry, we can define an 
affinely regular polygon whose vertices P; are derived from suitable point Po 
by repeated application of an equiaffinity. The polygon is said to be of 
type {n} if the equiaffinity is an elliptic rotation 13.49, where 6 = 27/n and 
nis rational, so that P; has affine coordinates 


(cos j@, sin 6) (9 = 2n/n). 


Figure 13.4e shows an affinely regular pentagram (n = 5/2) and pentagon 
(n = 5). 


P4 iP, 


Py Po Po 


P3 Po Py Po 


Figure 13.4e Figure 13.4f 


EXERCISES 

1. Two triangles with a common side (such as ABC and BCD in Figure 13.2d) 
have the same area if and only if the line joining their remaining vertices is parallel to 
the common side (that is, AD parallel to BC). 

2. Ifa pentagon has four of its diagonals parallel to four of its sides, the remaining 
diagonal is parallel to the remaining side. 

3. When is a dilatation an equiaffinity? 

4. When is a shear an equiaffinity? 

5. When is a strain an equiaffinity? 

6. The product of any even number of affine reflections is an equiaffinity. 
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7. Any translation or half-turn or shear can be expressed as the product of two 
affine reflections. 
8. If an equiaffinity is neither a translation nor a half-turn nor a shear, it can be 
expressed as PoP}P2—> P,P2P3 where PoP; is parallel to P}P2. (See Figure 13.4f) 
9. Every equiaffinity can be expressed as the product of two affine reflections. 
(Veblen.) 
10. In an affinely regular polygon PoPiP2..., the lines P,P; and P,P; are parallel 
whenever? + j=A +k. 
11. Why did we call x? + y? = | an ellipse rather than a circle (just below 13.49)? 
12. What triangles and quadrangles are affinely regular? 
13. Construct an affinely regular hexagon. 
14. Compute the ratio PoP3/P1Pe for an affinely regular polygon of type {n}. 
15. For which values of n can an affinely regular polygon of type {n} be constructed 
with a parajlel-ruler? 


13.5 TWO-DIMENSIONAL LATTICES 


Farey has a notice of twenty lines in the Dictionary of National Biog- 
raphy... . His biographer does not mention the one thing in his life 
which survives. 


G. H. Hardy 
[Hardy and Wright 1, p. 37] 


Our treatment of lattices in § 4.1 (as far as the description of Figure 4.1d) 
is purely affine. In fact, a lattice is the set of points whose affine coordi- 
nates are integers. Any one of the points will serve as the origin O. 

Let A’ be any lattice point, and A the first lattice point along the ray OA’. 
Following Hardy and Wright [1, p. 29], we call A a visible point, because 
there is no lattice point between O and A to hide A from an observer at O. 
In terms of affine coordinates, a necessary and sufficient condition for (x, y) 
to be visible is that the integers x and y be coprime, that is, that they have 
no common divisor greater than 1. The three visible points 


dd, 0), (d, 1), (, 1) 


form with the origin a parallelogram. This is called a unit cell (or “typical 
parallelogram”) of the lattice, because the translations transform it into in- 
finitely many such cells filling the plane without overlapping and without 
interstices: it is a fundamental region for the group of translations. Thus 
it serves as a convenient unit for computing the area of a region. 

According to Steinhaus [2, pp. 76-77, 260] it was G. Pick, in 1899, who 
discovered the following theorem:* 


* For an extension to three dimensions, see J. E. Reeve, On the volume of lattice polyhedra, 
Proceedings of the London Mathematical Society (3), 7 (1957), pp. 378-395. 
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Figure 13.5a 


13.51 The area of any simple polygon whose vertices are lattice points is 
given by the formula 


4b4+ec-1, 


where b is the number of lattice points on the boundary while c is the number of 
lattice points inside. 

(By a “simple” polygon we mean one whose sides do not cross one an- 
other. Figure 13.5a shows an example in which b = 11, c = 3.) 

Proof. We first observe that the expression 45 + c — | is additive when 
two polygons are juxtaposed. In fact, if two polygons, involving b; + ci and 
be + Cg lattice points respectively, have a common side containing n (> 0) 
lattice points in addition to the two vertices at its ends, then the values of 
b and c for the combined polygon are 


b = by + by — 2n — 2, c= +02 +4, 
so that 
4b4+e-1L=G6h4a—-1 4+ Gbo + ce — 1). 


Next, the formula holds for a parallelogram having no lattice points on 
its sides (so that b = 4 and the expression reduces toc + 1). For, when 
N such parallelograms are fitted together, four at each vertex, to fill a large 
region, the number of lattice points involved (apart from a negligible pe- 
ripheral error) is N(c + 1), and this must be the same as the number of unit 
cells needed to fill the same region. 

Splitting the parallelogram into two congruent triangles by means of a 
diagonal, we see that the formula holds also for a triangle having no lattice 
points on its sides. A triangle that does have lattice points on a side can 
be dealt with by joining such points to the opposite vertex so as to split the 
triangle into smaller triangles. This procedure may have to be repeated, 
but obviously only a finite number of times. Finally, as we remarked on 
page 204, any given polygon can be dissected into triangles; then the ex- 
pressions for those pieces can be added to give the desired result. 


In particular, any parallelogram for which b = 4 andc = 0 has area 1 
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and can serve asaunitcell. Ifthe vertices of such a parallelogram (in coun- 
terclockwise order) are 


(0,0), (x,y), (© + x1,.y +1), (1, 1), 
we see from 13.44 that 


13.52 X¥1 — Yx1 = 1. 
In other words, this is the condition for the points 
13.53 (90,9), (%¥), (1, ¥1) 


to form a positively oriented “empty” triangle of area 4, which could be used 
just as well as (0, 0) (1, 0) (0, 1) to generate the lattice. Thus a lattice is com- 
pletely determined, apart from its position, by the area ofits unit cell. More- 
over, although there are infinitely many visible points in a given lattice, they 
all play the same role. (These properties of affine geometry are in marked 
contrast to Euclidean geometry, where the shape of a lattice admits unlim- 


ited variation and each lattice contains visible points at infinitely many dif- 
ferent distances.) 
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Figure 13.5b 


George Pélya* has applied 13.52 to a useful lemma in the theory of num- 
bers. The Farey series F,, of order n is the ascending sequence of fractions 


from 0 to 1 whose denominators do not exceed n. Thus y/x belongs to Fy, 
if x and y are coprime and 


13.54 O<y<xcn. 


For instance, F5 is 


$.4,4,4,%,4,3,4,4,4,4- 


* Acta Tittornarum ae Scientiarum R sia niversi atis Hune ric e Francisco-Josephinae, Sectio 
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The essential property of such a sequence, from which many other properties 
follow by simple algebra, is that 13.52 holds for any two adjacent fractions 


¥ and *2. 

x XY 
To prove this, we represent each term y/x of the sequence by the point (x, y) 
ofa lattice. For example, the terms of Fs are the lattice points emphasized 
in Figure 13.55 (where, for convenience, the angle between the axes is ob- 
tuse). Since the fractions are in their “lowest terms,” the points are visible. 
By 13.54, they belong to the triangle (0, 0) (n, 0) (#, n). A ray from the ori- 
gin, rotated counterclockwise, passes through the representative points in 
their proper order. If y/x and y1/x; are consecutive terms of the sequence, 
then (x, y) and (xz, v1) are visible points such that the triangle joining them 
to the origin contains no lattice point in its interior. Hence this triangle 
is one half of a unit cell, and 13.52 holds, as required. 


Figure 13.5c¢ 


Another result belonging to affine geometry is 


13.55 If the sides BC, CA, AB ofa triangle ABC are divided at L, M, N 
in the respective ratios \; 1, mu: 1,»: 1, the Cevians AL, BM, CN form a tri- 
angle whose area is 


(Apy — 1)? " 
Ap taA+ Dw tpt IOA+%4+ 1) 
times that of ABC. 


This was discovered by Routh [1, p. 82; see also Dorrie 1, pp. 41-42]. We 
shall give a general proof in § 13.7, but it is interesting to observe that, when 
A = p = », So that the ratio of areas is (A — 1)8/(A3 — 1), the result can 
be deduced from 13.51. For instance, when A = p = v = 2, so that each 
side is trisected [Steinhaus 2, p. 8], the central triangle is one-seventh of 
the whole, and we can see this immediately by embedding the figure in a 
lattice, as in Figure 13.5c. Since the central triangle has b = 3, c = 0 while 
ABC has b = 3, c = 3, the ratio of areas is 4/5 = }. 
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1. Ify/x and y1/x1 are two consecutive terms of a Farey series, x and x, are co- 
prime. 
2. If yo/xo, ¥/X, i1/x1 are three consecutive terms of a Farey series, 


Yor 
Xo + x1 ~ x 
(C. Haros, 1802.) 

3. The points 4, B, C in Figure 13.5¢ belong to a lattice whose unit cell has seven 
times the area of that of the basic lattice. (For the Euclidean theory of such com- 
pound lattices, see Coxeter, Configurations and maps, Reports of a Mathematical Col- 
loquium (2), 8 (1948), pp. 18-38, especially Figs. i, v, vii.) 

4. Use lattices to verify 13.55 when (a)A = pw =v =3,()A = p=v = Fz. 

5. Join the vertices A, B, C, D of a parallelogram to the midpoints of the respec- 
tive sides BC, CD, DA, AB so as to form a smaller parallelogram in the middle. Its 
area is one-fifth that of ABCD. Another such parallelogram is obtained by joining 
A, B,C, D to the midpoints of CD, DA, AB, BC. The common part of these two small 
parallelograms is a centrally symmetrical octagon whose area is one-sixth that of ABCD 
[Dorrie 1, p. 40]. 

6. In the notation of 13.55, the area of the triangle LMN is 

Auy + | 
A+ Dut Dy +b 
times that of ABC. (Hint: Use 13.42 to compute the relative area of CLM, etc.) 


7. Of the four triangles ANM, BLN, CML, LMN, the last cannot have the smallest 
area unless L, M, N are the midpoints of BC,CA,AB. (H. Debrunner.*) 


13.6 VECTORS AND CENTROIDS 


A vector is really the same thing as a translation, although one uses 

different phraseologies for vectors and translations. Instead of speak- 

ing of the franslation A — A’ which carries the point A into A’ one 
=> 

speaks of the vector AA’... . The same vector laid off from B ends 

in B’ if the translation carrying A info A’ carries B into B’. 


H. Weyl [1, p. 45] 


AS we saw in § 2.5, a group is an associative system containing an iden- 
tity and, for each element, an inverse. Arithmetical instances are provided 
by the positive rational numbers, the positive real numbers, the complex 
numbers of modulus 1, and all the complex numbers except 0, combined, 
in each case, by ordinary multiplication. Such instances make it natural to 
adopt a multiplicative notation for all groups, so that the combination of S 
and T is ST, the inverse of S is S-*, and the identity is 1. However, it is 
often convenient, especially in the case of Abelian (i.e, commutative) 

* Elemente der Mathematik, 12 (i957), p. 43, Aufgabe 260. 
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groups, to use instead the additive notation, in which the combination of 
S and Tis 8S 4+ T, the inverse of S is —S, and the identity is 0. To see 
that this other notation has equally simple arithmetical instances, we merely 
have to consider in turn the integers, the rational numbers, the real num- 
bers, and the complex numbers, combined, in each case, by ordinary addi- 
tion. 

The transition from a multiplicative group to the corresponding additive 
group is the foundation of the theory of logarithms [Infeld 1, pp. 97-100]. 

When we go outside the domain of arithmetic, the choice between mul- 
tiplication and addition is merely a matter of notation. In particular, the 
Abelian group of translations, which we have expressed as a multiplicative 
group, becomes the additive group of vectors. 


In this notation, 13.21 asserts that any two points A and A’ determine a 
—> 
unique vector 44’ (going from A to A’), Figure 13.2d illustrates a situation 


in which 
—_— — — 


AA’ = CC’ = BB’, 
13.23 asserts that 


and 3.23 asserts that, for any two vectors a and b, 


a+b=-ba. 


In the same spirit the “origin” will henceforth be called O instead of /, and 
the zero vector will be denoted by 0. The integral multiples of any non- 
zero vector proceed from the origin to the points of a one-dimensional lat- 
tice. Two vectors e and f are said to be independent if neither is a (real) 
multiple of the other, that is, if the only numbers that satisfy the vector 
equation 

xe + yf —0 


are x = 0 andy = 0. Two such vectors (corresponding to the translations 
X and Y in Figure 4.1c) provide a basis for a system of affine coordinates: 
they enable us to define the coordinates of any point to be the coefficients 
in the expression 

xe + yf 


for the position vector which goes from the origin to the given point. In 
other words, with reference to a triangle OAB, the affine coordinates of a 
point P are the coefficients in the expression 

—_— — — 


OP = x OA + y OB. 


We shall find it useful to borrow from statics the notion of the centroid 
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(or “center of gravity”) of a set of “weighted” points, that is, of points to 
each of which a real number is attached in a special way. For convenience, 
we shall call these numbers masses, although, when some of them are nega- 
tive, electric charges provide a more appropriate illustration. 

Let masses f1,..., tf, be assigned to k distinct points 41,..., Ax, let O be 
any point (possibly coincident with one of the A4’s), and consider the vector 


— —= 


ty OA, +... + th OAx. 


If ft, + ...+ & = 0, this vector is independent of the choice of O. For, 
if we subtract from it the result of using O’ instead, we obtain 


=> —_ —~ > 


t; (OA1 — OYA1) +... + & (OA, — O'AX) 
— 
=(4 +...+%)00' =0. 
More interestingly, if 
Ht... 4+4% 40, 


we have 


— — — 


ty OA, +...4+ 4 OAR = (4 +... 4+ &) OP, 


where the point P is independent of the choice of O. For, if the same pro- 
cedure with O’ instead of O yields P’ instead of P, we have, by subtraction, 


as 
(,4+...+%)O00 =(h+...4+ %)(OP — O'P’) 


> > > > 
whence OP’ = OO’ + O'P’ = OP, 
so that P’ coincides with P. This point P, given by 


— — 
xt; OP = xt; OAi, 


is called the centroid (or “barycenter’’) of the k masses t; at Ai. 
Since, having found P, we may choose this position for O, we have 


— 
=t; PA; — O. 
If there are only two points, 
—_— —_— 
t, PA, = —te PAg, 


so that P lies on the line 4142 and divides the segment A1Ap in the ratio 
to:t1. In particular, iff; = te, P is the midpoint of 414. 
For a triangle 414243, we have 
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— = — — 

(ty +tot+ tz) OP = t}OA1 + teOA2 + t30A3 
— — 
= HOA1 + (fe + bs) OF, 


where Q is the centroid of tg at Az and fg at As. Thus, in seeking the centroid 
of three masses, we may replace two of them by their combined mass at their 
own centroid. (There is an obvious generalization to more than three 
masses.) In particular, when f; = te = fs (= 1, say), Q is the midpoint of 
A2A3, and P divides A,Q in the ratio2:1. Thus the “centroid” G of a tri- 
angle (§ 1.4) is the centroid of equal masses at its three vertices. 


Ay 


As Q Ay 


Figure 13.6a 


This same point G, where the medians concur, is also the centroid of a 
triangular lamina or “plate” of uniform density. (Strictly speaking, this no- 
tion requires integral calculus.) For we may divide the triangle into thin 
strips parallel to the side 42A3, as in Figure 13.6a. The centroids of these 
strips evidently lie on the median 41Q. Hence the centroid of the whole 
lamina lies on this median, and similarly on the others. (This argument was 
used by Archimedes in the third century B.c.) 


EXERCISES 


1. Verify in detail that 
(i) the positive rational numbers, 
(ii) the positive real numbers, 
(iii) the complex numbers of modulus 1, 
(iv) all the complex numbers except 0 
form multiplicative groups; and that 
(v) the integers, 
(vi) the rational numbers, 
(vii) the real numbers, 
(viii) the complex numbers 
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form additive groups. Explain why the first four sets do not form additive groups, and 
why the last four do not form multiplicative groups. 

2. If A, B, C are on one line and A’, B’, C’ on another with 


AB _ BC 
A‘B? _ BC’ 


then points dividing all the segments AA’, BB’, CC’ in the same ratio are either 
collinear or coincident (cf. § 3.6), (Hint: Consider the centroid of suitable masses at 
A CAC 

3. The centroid of equal masses at the vertices of a quadrangle is the center of the 
Varignon parallelogram (Figure 13.22). 


Figure 13.6b 


4. The centroid of a quadrangular lamina is the center of the Wittenbauer paral- 
lelogram, whose sides join adjacent points of trisection of the sides, as in Figure 13.65. 
This theorem, due to F. Wittenbauer (1857-1922) [Blaschke 2, p. 13], was rediscov- 
ered by J. J. Welch and V. W. Foss.* 

5. For what kind of quadrangle will the centroids described in the two preceding 
exercises coincide? 


13.7 BARYCENTRIC COORDINATES 


If 4) + t 0, masses ¢; and fe at two fixed points 4; and Az determine 
a unique centroid P, asin Figure 13.7a. This point is A, itselfift. = 0, Ao 
if t; = 0. It is on the segment 4;4¢ if the /’s are both positive (or both 
negative), on the ray Ay/Av if 

ty > —te > 0, 
and on the ray A42/A1 if tp > -—h > 0. 


* Mathematical Gazette, 42 (1958), p. 55; 43 (1959), p. 46. 
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Figure 13.7a 


Conversely, given a point P on the line 4;A2, we can find numbers ¢, and 
te such that 
to AP Be ty _ PAs | 
ty PAs tg A;P’ 


then P will be the centroid of masses t; and tf. at A; and Ae. Since masses 
pty and pt. (where » 0) determine the same point as ¢, and fz, these bary- 
centric coordinates are homogeneous: 


(41, f2) = (utr, ple) (u # 9). 


Similarly, as Mébius observed in 1827, we may set up barycentric coor- 
dinates in the plane of a triangle of reference AyA2A3. If ti + te + tg #9, 
masses 1, fo, ts at the three vertices determine a point P (the centroid) whose 
coordinates are (t, f2, ts). In particular, (1, 0, 0) is Ai, (0, 1, 0) is Aa, 
(0, 0, Ll) is As, and (0, to, tg) is the point on 4243 whose one-dimensional co- 
ordinates with respect to Az and Az are (fz, ¢3). To find coordinates for a 
given point P of general position, we find ¢2 and fs from such a point Q on 
the line 4,P, as in Figure 13.75, and then determine f; as the mass at A, 
that will balance a mass fg + fs at Q so as to make P the centroid. Again, 
as in the one-dimensional case, these coordinates are homogeneous: 


(t1, fo, #3) = (uti, plo, pls) = (u #0). 


Joining P to Ay, Ae, As, we decompose A142Az3 into three triangles hav- 
ing a common vertex P. The areas of these triangles are proportional to the 
barycentric coordinates of P, as in Figure 13.7c. This fact follows at once 
from 13.42, since 


ts A2Q _ A1A2Q _ PA2Q _ Ar42Q — PA2Q _ PAsA2 
tg QA3  Ai1QA3 POA;  A,OA3 — POA; PA3A,’ 


Figure 13.7b . Figure 13.7¢ 
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and similarly for t1/ts, tg/ty. Positions of P outside the triangle are covered 
by means of our convention for the sign of the area of a directed triangle. 
The inequality 
b+ + t3 #0 


enables us to normalize the coordinates so that 
13.71 htrbt+e=l. 


(We merely have to divide each coordinate by the sum of all three.) These 
normalized barycentric coordinates are called areal coordinates, because 
they are just the areas of the triangles P4243, PA3A1, PAiAo, expressed in 
terms of the area of the whole triangle 41A2A3 as unit of measurement. 
Areal coordinates are not homogeneous but “redundant”: the position of a 
point is determined by two of the three, and the third is retained for the 
sake of symmetry. However, any expression involving them can be made 
homogeneous by inserting suitable powers of t; + f2 + ts in appropriate 
places. 


3 Ay 


Figure 13.7d 


In affine coordinates, as we have seen, a line has a linear equation. In 
barycentric coordinates, as we shall soon see, a line has a linear homogene- 
ous equation. For this purpose we use the segments 4341 and A3Aze as axes 
for affine coordinates, as in Figure 13.7d, so that the coordinates of P, A1, 
Ag, A3, which were formerly 


(¢1, ta, ts), d, 0, 0), (0, i 0), (0, 0, 1), 
are now 


(x, Y), (1,9), ©,1), (0, 9). 


By 13.44, the areas of PA2A3 and PA3Au1, as fractions of the “unit” triangle 
A 1A0A3, are just 


1 0 
x y 


x y 


0 | =. 


|= x and 


By subtraction, the area of PA;A2 is 1 — x — y. Hence the areal coordi- 
nates of P are related to the affine coordinates by the very simple formulas 
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h=x, b=y tt=l—-x-—y. 
The general line, having the affine equation 8.11, has the areal equation 
aty + bte +c = 0. 


Making this homogeneous by the insertion of t1 + f2 + ts, we deduce the 
barycentric equation 


aty + bte + c(t1 + te + ts) = 0 
or (a + c)ty + (6 + c)fe + cts = 0 


or, in a More symmetrical notation, 


13.72 Tith + Tote + T3tz = 0. 


Thus every line has a linear homogeneous equation. In particular, the 
lines 4943, A3A1, A1Ae have the equations 


13.73 4h = 0, 2£=90, t= 0. 
The line joining two given points (r) and (s), meaning 
(r1, r2,r3) and (51, Se, $3), 


has the equation 


ry ro fs 
13.74 Sy Sq Sg] = 0. 
th to ts 


For, this equation is linear in the #’s and 1s satisfied when the ?’s are replaced 
by the 7’s or the s’s._ Another way to obtain this result is to ask for the fixed 
points (r) and (s) to form with the variable point (¢) a “triangle” whose area 
is zero. In terms of areal coordinates, with the triangle of reference as unit, 
the area of the triangle (r)(s)(¢) is, by 13.45 and 13.71, 


ro ore | nore nmt+rt+hrs rr fe rg 
Sy Sp 1} = ]s1 So $1 + 52 4+ 53] = | 51 So 53]. 
th te 1 hot hth + bs th to bs 


Hence the area in general barycentric coordinates is this last determinant 
divided by 
(r1 + re + r3)(S1 + So + S3)(tr + to + 23). 


We are now ready to prove Routh’s theorem 13.55 in its full generality. 
Identifying ABC with A,Ae2A3, so that the points L, M, N are 


(0, l, A), (u, 0, 1), a, VY, 0), 
we can express the lines AL, BM, CN as 


Ato = tz, plg = 1, whi = fe. 
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They intersect in pairs in the three points 


(u, wr, 1), (1, », vA), (Ap, 1, A), 


forming a triangle whose area, in terms of that of the triangle of reference, 
is the result of dividing the determinant 


wo py 1 
1 pv vA] = App — 1) 
Aw 1A 


by (u + py + 1)(1 + » + vAj(An + 1 + A), in agreement with the state- 
ment of 13.55. 
As an important special case we have 


CEVA'S THEOREM. Let the sides of a triangle ABC be divided at L, M, N 
in the respective ratios\: 1, w:1, v:1. Then the three lines AL, BM, CN 
are concurrent if and only if Apy = 1. 


The general line 13.72 meets the sides 13.73 of the triangle of reference 
in the points 


(9, Ts, ~~ T2), (- T3, 0, T1), (Te, ~~ T1, 0), 


which divide them in the ratios 


whose product is —1. Conversely, any three numbers whose product is — 1 
can be expressed in this way for suitable values of 71, T2, T3. Hence 


MENELAUS'S THEOREM. Let the sides of a triangle be divided at L, M, N 
in the respective ratios \: 1, 4: 1,v:1. Then the three points L, M, N are 
collinear if and only if \pv = —1. 


A: 


Figure 13.7e 


The coefficients 71, 72, T3 in the equation 13.72 for a line are sometimes 
called the tangential coordinates of the line. These homogeneous “coordi- 
nates” have a simple geometric interpretation [Salmon 1, p. 11]: they may 
be regarded as the distances from A1, Ao, As to the line, measured in any di- 
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rection (the same for all). To prove this, let 41Fi, AoF2, A3f3 be these dis- 
tances, as in Figure 13.7e. Since 


AN Th 

NAg T2’ 

the homothetic triangles NA;F1 and NA2F?2 yield 
AF, x, AiN ze Ti 
AoFo 7 AoN = To , 


Hence AE. Aoks . 
Ty T2 
ta ee A3F3 
and similarly each of these expressions is equal to t 
3 


Mobius’s invention of homogeneous coordinates was one of the most far- 
reaching ideas in the history of mathematics: comparable to Leibniz’s in- 
vention of differentials, which enabled him to express the equation 


ASO) =f) 


in the homogeneous form 


Af (x) = f(x) dx 


(for instance, d sin x = cos x dx). 


EXERCISES 
1. Sketch the seven regions into which the lines 4943, 4341, 4142 decompose the 
plane, marking each according to the signs of the three areal coordinates. 


2. Verify that 13.45 yields 1 — x — yas the area of the triangle PA,A2 in Figure 
13.7d. 


3. In areal coordinates, the midpoint of (53, se, 53)(f1, fe, f3) 18 


Sitti Se+te s3+ lb 
Da *3 2 0 2 : 


4. The centroid of masses o and 7 at points whose areal coordinates are (51, 52, 53) 
and (fi, fe, 73) is the point whose barycentric coordinates are 


(051 + Th, oS + Tle, OS3 + TH3). 
5. In barycentric coordinates, any point on the line (s)(¢) may be expressed in the 
form 
(os; + Th, O82 + Tle, O53 + Tt3). 
6. Apply barycentric coordinates to Ex. 6 at the end of § 13.5. What becomes of 
this result when L, M, N are collinear? 
7. In what way do the signs of 71, T2, T3 depend on the position of the line 13.72 


in relation to the triangle of reference? When T> and 73 are positive, describe the cases 
To < T3, Te = T3, Te > T3. 


222 AFFINE GEOMETRY 


13.8 AFFINE SPACE 


Give me something to construct and | shall become God for fhe time 
being, pushing aside all obstacles, winning all the hard knowledge | 
need for the construction... advancing Godlike fo my goal! 


J.L. Synge [2, p. 162] 


Affine geometry can be extended from two dimensions to three by using 
Axioms 12.42 and 12.43 instead of 12.41. The total number of axioms is 
not really increased, as 13.12 now becomes a provable theorem [Forder 1, 
pp. 155-157]. A line and a plane, or two planes, are said to be parallel if 
they have no common point (or if the line lies in the plane, or if the two 
planes coincide). Thus any plane that meets two parallel planes meets them 
in parallel lines; if two planes are parallel, any line in either plane is paral- 
lel to the other plane; if two lines are parallel, any plane through either line 
is parallel to the other line. 

The existence of parallel planes is ensured by the following theorem (cf. 
Axiom 13,11): 


13.81 For any point A and any plane y, not through A, there is just one 
plane through A parallel to y. 


Proof. Let q and r be two intersecting lines in y. Let g’ and ?’ be the re- 
spectively parallel lines through A. Then the plane q’7’ is parallel to y. For 
otherwise, by 12.431, the two planes would meet in a line 1. Since g’ and ?’ 
are parallel to y, they cannot meet J, Thus gq’ and /’ are two parallels to / 
through A, contradicting 13.11. This proves that q’r’ is parallel to y. More- 
over, g’r’ is the only plane through A parallel to y. For, two such would meet 
in a line s’ through A, and we could obtain a contradiction by considering 
their section by the plane As, where s is a line in y not parallel to s’. 


Figure 13.8a 


Parallelism for lines is transitive in space as well as in a plane: 


13.82 Jf p and q are both parallel to r, they are parallel to each other. 
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Proof [Forder 1, p. 140]. When all three lines are in one plane, this fol- 
lows at once from 13.11, so let us assume that they are not. For any point 
Q on gq, the planes Qp and Qr meet in a line, say q’ (Figure 13.84). Any 
common point of g’ and r would lie in both the planes Qp, pr, and there- 
fore on their common line p; this is impossible, since pis parallel tor. Hence 
q’ is parallel to r. But the only line through Q parallel tor is g. Hence ¢ 
coincides with gq’, and is coplanar with p. Any common point of p and q 
would lie also on r. Hence p and gq are parallel. 


The transitivity of parallelism provides an alternative proof for 13.81. To 
establish the impossibility of a point O lying on both planes y and q’r’, we 
imagine two lines through O, parallel to g (and q’), r (and r’). The planes 
y and q’r’, each containing both these lines, would coincide, contradicting 
our assumption that A does not lie in y. 

The three face planes OBC, OCA, OAB of a tetrahedron OA BC form with 
the respectively parallel planes through A, B, Ca parallelepiped whose faces 
are six parallelograms, as in Figure 13.85 [Forder 1, p. 155]. 


(x,y,z) 


Figure 13.8b Figure 13.8¢ 


it is now easy to build up a three-dimensional theory of dilatations, trans- 
lations, and vectors. Three vectors d, e, f are said to be dependent if they 
are coplanar, in which case each is expressible as a linear combination of 
the other two. Three vectors e, f, g are said to be independent if the only 
solution of the vector equation 


xe + yf + zg = O 
isx = y=z=0. Three such vectors provide a basis for a system of three- 
dimensional affine coordinates. \n fact, if 


—> —- —_— 
e = OA, f = OB, g = OC, 


as in Figure 13.8c, the general vector OP may be exhibited as a diagonal of 
the parallelepiped formed by drawing through P three planes parallel to 
OBC, OCA, OAB. Then 


_—> 
OP = xe 4+ yf 4+ zg, 


224 AFFINE GEOMETRY 


where the terms of this sum are vectors along three edges of the parallele- 


piped. 
In space, as in a plane, the centroid P of masses ¢, at points 4; is deter- 


— 
mined by a vector OP such that 
> — 
Dt, OP = Zt, OA; (24 x 0). 
ae 
If OA; = xe + yif + zig, we deduce 


Sera 
Zt, OP = Dtxye + Lyf + LAZig. 


Hence, in terms of affine coordinates, 


13.83 The centroid of k masses t, (Zt; ~ 0) at points (xi, yi, Zi) 
G=l,...,kjis 
LHxG Zh Yi D442; 
>) > Le 


In particular, if t1 + t2 + ts = 1, the centroid of three masses (1, fo, t3 
at the points 
(1, 9, 0), (0, 1, 9), (0, 0, 1) 
is (t1, ta, ts). Hence 


13.84 The affine coordinates of any point in the planex + y +z = 1 
are the same as its areal coordinates referred to the triangle cut out from this 
plane by the coordinate planes x = 0, y= 0, z = 0, 


It follows that there is a line 


through the origin (in affine space) for each point with barycentric coordi- 
nates (f1, fo, tg). On the other hand, lines lying in the plane x + y + z= 0 
yield no corresponding points in the parallel plane x + y + z = 1, unless 
we agree to extend the affine plane by postulating a line at infinity 


ht+t+4=0 


so as to form the projective plane. This possibility has already been men- 
tioned in § 6.9; we shall explore it more systematically in Chapter 14. 


EXERCISES 


1. Ifa line a is parallel to a plane a, and a plane through a meets a in b, then a 
and b are parallel lines. 1f another plane through a meets a inc, then } and ¢ are paral- 
lel lines. 

2. \fa, B, y are planes intersecting in lines B-y = a, y-a=b, a-B =c, and 
a is parallel to b, then a, b, c are all parallel. 

3. ‘All the lines through A parallel to a are in a plane parallel to a [Forder 1, p. 155]. 
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4. Each of the six edges of a tetrahedron lies on a plane joining this edge to the 
midpoint of the opposite edge. The six planes so constructed all pass through one 
point: the centroid of equal masses at the four vertices. 

5. Develop the theory of three-dimensional barycentric coordinates referred to a 
tetrahedron A,A9A3A4. 


13.9 THREE-DIMENSIONAL LATTICES 


The small parallelepiped built upon the three translations selected as 
unit franslations .. . is known as the unit cell... The entire crystal 
structure is generated through the periodic repetition, by the three unit 
translations, of the matter contained within the volume of the unif cell. 


M. J. Buerger (1903 - ) 
[Buerger 1, p. 5] 


The theory of volume in affine space is more difficult than that of area in 
the affine plane, because of the complication introduced by M. Dehn’s ob- 
servation that two polyhedra of equal volume are not necessarily derivable 
from each other by dissection and rearrangement. A valid treatment, sug- 
gested by Mrs. Sally Ruth Struik, may be described very briefly as follows. 
It is found that any two tetrahedra are related by a unique affinity ABCD > 
A’B'C'D’, which transforms the whole space into itself in such a way as to 
preserve collinearity. In particular, a tetrahedron ABCC’ is transformed 
into ABC’C by the affine reflection 


AB(CC’, 


which interchanges C and C’ while leaving invariant every point in the plane 
that joins AB to the midpoint of CC’. Two tetrahedra are said to have 
the same volume if one can be transformed into the other by an equiaffinity: 
the product of an even number of affine reflections. Such a comparison is 
easily extended from tetrahedra to parallelepipeds, since a parallelepiped 
can be dissected into six tetrahedra all having the same volume. 

In three dimensions, as in two, a lattice may be regarded as the set of 
points whose affine coordinates are integers. However, as it is independent 
of the chosen coordinate system, it is more symmetrically described as a 
discrete set of points whose set of position vectors is closed under subtrac- 
tion, that is, along with any two of the vectors the set includes also their 
difference. Subtracting any one of the vectors from itself, we obtain the 
zero vector 

ec—c=0 


and hence alsoO — b = —b,a — (—b) = a+ b,a 4+ a = 2a, and so 
on: the set of vectors, containing the difference of any two, also contains the 
sum of any two, and all the integral multiples of any one. The lattice is one-, 
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two-, or three-dimensional according to the number of independent vectors. 
In the three-dimensional case, a set of three independent vectors e, f, g is 
called a basis for the lattice if all the vectors are expressible in the form 


13.91 xe + yf + Zg, 


where x, y, Z are integers. If three of these vectors, say ri, ra, rs form an- 
other basis for the same lattice, there must exist 18 integers 


Ans Das Cas An, Bas Ca (a = 1, 2, 3) 
such that 
rno=aet+bftag, e= 24,r,, F= UB, g= Dury 
and therefore 
ty = 4,2Ag rg + b,DBeg tg + CobCg Up, 


whence 


1 ifa = B, 
A, Ag + 6b, Bg + Cy Cg = {o foie 


Since the product of two determinants is obtained by combining the rows 
of one with the columns of the other, we have 


ay by C1 Ai Ag A3 100 
a2 bo c2 By, Bo Bs =/0 1 OJ=1. 
ag bs C3 Ci Co C3 0 0 1 


Since the two determinants on the left are integers whose product is 1, each 
must be +1. Conversely, if a,, b., C, are given so that their determinant is 
+1, we can derive A,, B,, C, by “inverting the matrix,” and the given basis 
e, f, g yields the equally effective basis r,. Hence 


A necessary and sufficient condition for two triads of independent vectors 
e,f,g and a,e+b,f +eag (a = 1, 2, 3) 


to be alternative bases for the same lattice is 


ay by C1 
13.92 a2 bo co} = +1 
a3 bs C3 


(ef. Hardy and Wright 1, p. 28; Neville 1, p. 5]. 


In other words, a lattice is derived from any one of its points by applying a 
discrete group of translations: one-, two-, or three-dimensional according as 
the translations are collinear, coplanar but not collinear, or not coplanar. 
In the one-dimensional case the generating translation is unique (except that 
it may-be reversed), but in the other cases the two or three generators, that is, 
the basic vectors, may be chosen in infinitely many ways. When they have 
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been chosen, we can use them to set up a system of affine coordinates so that, 
in the three-dimensional case, the vector 13.91 goes from the origin (0, 0, 0) 
to the point (x, y, z), and the lattice consists of the points whose coordinates 
are integers. The eight points 


(0, 0, 0), (1, 0, 0), (0, 1, 0), (0, 0, 1), (0, 1, 1), 2, 9, 1, 1, 1, 9), CL, 1, D, 
derived from the eight vectors 
O, e, f, g. f+tg, gt+e, e — f, et+fig, 


evidently form a parallelepiped, which is a unit cell of the lattice. By an argu- 
ment analogous to that used for a two-dimensional lattice in § 4.1, any two 
unit cells for the same lattice have the same volume. 


Any line joining two of the lattice points contains infinitely many of them, 
forming a one-dimensional sublattice of the three-dimensional lattice. In 
fact, the line joining (0, 0, 0) and (x, y, z) contains also (nx, ny, nz) for every 
integer n. If x, y, z have the greatest common divisor d, the lattice point 


(x/d, y/d, z/d) 


lies on this same line, and the corresponding translation generates the group 
of the one-dimensional lattice. The lattice point (x, y, z) is visible if and only 
if the three integers x, y, z have no common divisor greater than 1. 

Any triangle of lattice points determines a plane containing a two-dimen- 
sional sublattice. For, if vectors 


ry = xye + pif + 23g and re = xe + yof + Zag 


have integral components, so also does fyr, + fore for any integers fy and fe. 
The parallel plane through any other lattice point will contain a congruent 
sublattice. Thus we may regard all the lattice points as being distributed 
among an infinite sequence of parallel planes, called rational planes [Buer- 
ger 1, p. 7]. 

Any such plane, being the join of three points whose coordinates are in- 
tegers, has an equation of the form 


13.93 Xx + Yy + Zz=N, 


where the coefficients X, Y, Z, N are integers, so that the intercepts on the 
coordinate axes have the rational values N/X, N/Y, N/Z. (This is the rea- 
son for the name “rational” planes.) We may assume that the greatest 
common divisor of X, Y, Z is 1; for, any common factor of X, Y, Z would 
be a factor of N too, and then we could divide both sides of the equation 
by this number, obtaining a simpler and equally effective equation for the 
same plane. 

Conversely, any such equation (in which the greatest common divisor of 
X, Y, Z is |) represents a plane containing a two-dimensional sublattice. 
This is obvious when X = |, since then we can assign arbitrar integral 
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values to y, Zz, and solve 13.93 for x. When X, Y, Z are all greater than |, we 
consider the set of numbers 


xX + yY¥ + 2Z, 


where x, y, z are variable integers while X, Y, Z remain constant. This set 
(like the set of lattice vectors) 1s an ideal: it contains the difference of any 
two of its members and (therefore) all the multiples of any one. Let d de- 
note its smallest positive member, and N any other member. Then N is a 
multiple of d: for otherwise we could divide N by d and obtain a remainder 
N — qd, which would be a member smaller thand. Thus every member of 
the set is a multiple of d But X, Y, Z are members. Therefore d, being a 
common divisor, must be equal to |, and the set simply consists of all the 
integers. In other words, the equation 13.93 has one integral solution (and 
therefore infinitely many) [cf. Uspensky and Heaslet 1, p. 54]. 

For each triad of integers X, Y, Z, coprime in the above sense (but not 
necessarily coprime in pairs), we have a sequence of parallel planes 13.93, 
evenly spaced, one plane for each integer N. Since every lattice point lies in 
one of the planes, the infinite region between any two consecutive planes is 
completely empty. One of the planes, namely that for which N = 0, passes 
through the origin. The nearest others, given by N = +1, are appropriately 
called first rational planes [Buerger 1, p. 9]. We shall have occasion to con- 
sider them again in § 18.3. 


EXERCISES 


1. How can a parallelepiped be dissected into six tetrahedra all having the same 
volume? 


2. Identify the transformation (x, y, z) > (x, y, —2) with the affine reflection that 
leaves invariant the plane z = 0 while interchanging the points (0, 0, +1). 


3. A lattice is transformed into itself by the central inversion that interchanges two 
of its points. 

4. Every lattice point in a first rational plane is visible. 

5. Is every rational plane through a visible point a first rational plane? 

6. Finda triangle of lattice points in the first rational plane 

6x + 10v 4+ I5z = 1. 
7. Obtain a formula for all the lattice points in this plane. 
8. The origin is the only lattice point in the plane 


x+ V2y 4+ V3z=0. 


14 


Projective geometry 


In affine geometry, as we have seen, parallelism plays a leading role. 
In projective geometry, on the other hand, there is no parallelism: every pair 
of coplanar lines is a pair of intersecting lines. The conflict with 12.61 is ex- 
plained by the fact that the projective plane 1s not an “ordered” plane. The 
set of points on a line, like the set of lines through a point, is closed: given 
three, we cannot pick out one as lying “between” the other two. At first 
sight we might expect a geometry having no circles, no distances, no angles, 
no intermediacy, and no parallelism, to be somewhat meagre. But, in fact, 
a very beautiful and intricate collection of propositions emerges: proposi- 
tions of which Euclid never dreamed, because his interest in measurement 
led him in a different direction. A few of these nonmetrical propositions 
were discovered by Pappus of Alexandria in the fourth century A.D. Others 
“are associated with the names of two Frenchmen: the architect Girard De-, 
sargues (1591- 1661) and the philosopher Blaise Pascal (1623— 1662). Mean- 
while, the related subject of perspective [Yaglom 2, p. 31] had been studied 
by artists such as Leonardo da Vinci (1452-1519) and Albrecht Direr (1471- 
1528). 

Kepler’s invention of points at infinity made it possible to regard the 
projective plane as the affine plane plus the line at infinity. A converse 
relationship was suggested by Poncelet’s Traité des propriétés projectives des 
figures (1822) and von Staudt’s Geometrie der Lage (1847), in which projec- 
tive geometry appeared as an independent science, making it possible to 
regard the affine plane as the projective plane minus an arbitrary line o, and 
then to regard the Euclidean plane as the affine plane with a special rule for 
associating pairs of points on o (in “perpendicular directions”) [Coxeter 2, 
pp. 115, 138]. This standpoint became still clearer in 1899, when Mario Pieri 
placed the subject on an axiomatic foundation. Other systems of axioms, 
slightly different from Pieri’s, have been proposed by subsequent authors. 
The particular system that we shall give in § 14.1 was suggested by Bach- 
mann [1, pp. 76-77]. To test the consistency of a system of axioms, we ap- 
ply it to a “model,” in which the primitive concepts are represented by fa- 
miliar concepts whose properties we are prepared to accept [Coxeter 2, pp. 
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186-187]. In the present case a convenient model for the projective plane 
is provided by the affine plane plus the line at infinity (§ 6.9). We shall ex- 
tend the barycentric coordinates of § 13.7 to general projective coordinates, 
so as to eliminate the special role of the line at infinity. The result may be 
regarded as a purely algebraic model in which a point is an ordered triad of 
numbers (x1, X2, x3), not all zero, with the rule that (wx1, px, 4x3) is the same 
point for any p #0, and a /ine is a homogeneous linear equation. One ad- 
vantage of this model is that the numbers x, and » are not necessarily real. 
The chosen axioms are sufficiently general to allow the coordinates to be- 
long to any field: instead of real numbers we may use rational numbers, com- 
plex numbers, or even a finite field such as the residue classes modulo a prime 
number. Accordingly we speak of the real projective plane, the rational pro- 
jective plane, the complex projective plane, or a finite projective plane. 


14.1. AXIOMS FOR THE GENERAL PROJECTIVE PLANE 


The more systematic course in the present introductory memoir... . 
would have been to ignore altogether the notions of distance and 
metrical geometry. . . . Metrical geometry is a part of descriptive 
geometry, and descriptive geometry is all geometry. 


Arthur Cayley *(1821 -1895} 


The projective plane has already been mentioned in §6.9. As primitive 
concepts we take point, line, and the relation of incidence. If a point and a 
line are incident, we say that the point lies on the line and the line passes 
through the point. The related words join, meet (or “intersect”), concurrent 
and collinear have their usual meanings. Three non-collinear points are the 
vertices of a triangle whose sides are complete lines. (“Segments” are not de- 
fined.) A complete quadrangle, its four vertices, its six sides, and its three di- 
agonal points, are defined as in § 1.7. A hexagon A,B2C1A2B1C2 has six 
vertices 41, Bo,..., Cy and six sides 


A, Bo, BsC1, CyA2, AoB1, B1C2, C2A1. 


Opposite sides are defined in the obvious manner; for example, A.B, is op- 
posite to A,B. After these preliminary definitions, we are ready for the five 
axioms. 


AXIOM 14.11.) Any two distinct points are incident with just one line. 
NOTATION. The line joining points A and B is denoted by AB. 


* Collected Mathematical Papers, 2 (Cambridge, 1889), p. 592. Cayley, in 1859, used the word 
“descriptive” where today we would say “projective.” His idea of the supremacy of projective ge- 
ometry must now be regarded as a slight exaggeration. It is true that projective geometry includes 
the affine, Euclidean and non-Euclidean geometries; but it does not include the general Rieman- 
nian geometry, nor topology. 
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AXIOM 14.12 Any two lines are incident with at least one point. 
THEOREM 14.121 Any two distinct lines are incident with just one point. 


NOTATION. The point of intersection of lines a and b is denoted by a: b; 
that of AB and CD by AB- CD. The line joining a+ b and c: dis denoted by 
(a+ b)(c+d). 

AXIOM 14.13 There exist four points of which no three are collinear. 


AXIOM 14.14 (Fano’s axiom) The three diagonal points of a complete 
quadrangle are never collinear. 


AXIOM 14.15 (Pappus’s theorem) Jf the six vertices of a hexagon lie al- 
ternately on two lines, the three points of intersection of pairs of opposite sides 
are collinear. 


One of the most elegant properties of projective geometry is the principle 
of duality, which asserts (in a projective plane) that every definition remains 
significant, and every theorem remains true, when we consistently inter- 
change the words point and line (and consequently interchange /ie on and 
pass through, join and intersection, collinear and concurrent, etc.). To estab- 
lish this principle it will suffice to verify that the axioms imply their own 
duals. Then, given a theorem and its proof, we can immediately assert the 
dual theorem; for a proof of the latter could be written down mechanically 
by dualizing every step in the proof of the original theorem. 

The dual of Axiom 14.11 is Theorem 14.121, which the reader will have 
no difficulty in proving (with the help of 14.12). The dual of Axiom 14.12 
is one-half of 14.11. The dual of Axiom 14.13 asserts the existence of a 
complete quadrilateral, which is a set of four lines (called sides) intersecting 
in pairs in six distinct points (called vertices). Two vertices are said to be 
opposite if they are not joined by a side. The three joins of pairs of oppo- 
site vertices are called diagonals. If PQRS is a quadrangle with sides 


p=PQ, q=PS, r=RS, s=QR, w= PR, u=Q5, 
as in Figure 14.la, then pqrs is a quadrilateral with vertices 


P=p-q, Q=pr°s, =r-s, =qr, Wa=pr, U=q°s. 


Figure 14.14 
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Axiom 14.14 tells us that the three diagonal points 
U=q's, V=wu, We=p-r 


are not collinear. Its dual asserts that the three diagonals of a complete 
quadrilateral are never concurrent. If this is false, there must exist a par- 
ticular quadrilateral whose diagonals are concurrent. Let it be pqrs, with 
diagonals 


u=- OS, v= WU, wo PR. 


Since these are concurrent, the point w+ u = V must lie on y, contradicting 
the statement that U, V, W are not collinear. 


Figure 14.1b 


Axiom 14.15 involves nine points and nine lines, which can be drawn in 
many ways (apparently different though projectively equivalent), such as the 
two shown in Figure 14.1b. A ,BeC,A2BiC2 is a hexagon whose vertices lie 
alternately on the two lines A1B,C,, A2BoC2. The points of intersection of 
pairs of opposite sides are 


A3 = ByCo- BoC1, By = CiA2* C2A1, C3 = A1Bo* AoBy. 


The axiom asserts that these three points are collinear. Our notation has 
been devised in such a way that the three points A;, B;, C;, are collinear 
whenever 

i+j+k=0 (mod 3).* 


Another way to express the same result is to arrange the 9 points in the form 
of a matrix 


A, By Cy 
14.151 Ag Bz Co 
As Bg C3 


* Coxeter, Self-dual configurations and regular graphs, Bulletin of the American Mathematical 
Society, 86 (1950), p. 432. 
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If this were a determinant that we wished to evaluate, we would proceed to 
multiply the elements in triads. These six “diagonal” triads, as well as the 
first two rows of the matrix, indicate triads of collinear points. The axiom 
asserts that the points in the bottom row are likewise collinear. Its inherent 
self-duality is seen from an analogous matrix of lines 


ay by cy 
az by Ce 
a3 bs C3 


These lines can be picked out in many ways, one of which is 


a, = A3B,Co, by = A, B3C2, ¢, = A2BoCo, 
a2 = A2B3Ci, be = A3BoCi, co = A1BiCi, 
a3 = A,BoC3, 63 = A2BiC3, cz = A3B3C3. 


This completes our proof of the principle of duality. 


EXERCISES 


1. Every line is incident with at least three distinct points. (This statement, and the 
existence of a nonincident point and line, are sometimes used as axioms instead of 14.13 
[Robinson 1, p. 10; Coxeter 2, p. 13].) 

2. Asset of m points and x lines is called a configuration (me, na) if c of the n lines pass 
through each of the points while d of the m points lie on each of the lines. The four 
numbers are not independent but satisfy cm = dn. The dual of (me, na) 18 (Na, Mc). 

In the case of a self-dual configuration, we have m = n,c = d, and the symbol (nq, na) 
is conveniently abbreviated to ng. Simple instances are the triangle 32, the complete 
quadrangle (43, 62) and the complete quadrilateral (62, 43). Axiom 14.14 asserts the 
nonexistence of the Fano configuration* 73. The points and lines that occur in Axiom 
14.15 (Figure 14.14) form the Pappus configuration 93, which may be regarded (in how 
many ways?) as a cycle of three triangles such as 


A;B,\Cz, A2BeC3, A3B3Ci, 


each inscribed in the next (cf. Figure |.8a, where UV W is inscribed in ABC). The self- 
duality is evident. 
By a suitable change of notation, Axiom 14.15 may be expressed thus: Jf AB, CD 
EF are concurrent, and DE, FA, BC are concurrent, then AD, BE, CF are concurrent. 
3. A particular finite projective plane, in which only 13 “points” and 13 “lines” exist, 
can be defined abstractly by calling the points P; and the lines p; i = 0, 1,..., 12) with 
the rule that P; and p; are “incident” if and only if 


i+j=0,1,30r9 (mod 13). 


Construct a table to indicate the 4 points on each line and the 4 lines through each point 
[Veblen and Young 1, p. 6]. Verify that all the axioms are satisfied; for example, 
PoP, P2Ps5 is a complete quadrangle with sides 


PoP; = po, PoP2=pi, PiPs=ps, PoPs=po, PePs = pu. PiP2 = pi2 


* Coxeter, Bulletin of the American Mathematical Society, 56 (1950), pp. 423-425. 
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and diagonal points P3 = po’ pis, P4 = Po’ Piz, Ps = pit ps. A possible matrix for 
Axiom 14.15 is 


Po Po Ps 
Pz; Ps Pe 
Po Py Ps 


The first row may be any set of three collinear points. The second row may be any 
such Set on a line not incident with a point in the first row. The last row is then deter- 
mined; e.g., in the above instance it consists of 


P2Po* PsP = Po, P3P3*PoPe = Pro, PoP: PoP3 = Ps. 


This differs from the general “Pappus matrix” 14.151 in that sets of collinear points 
occur not only in the rows and generalized diagonals but also in the columns. In other 
words, the 9 points form a configuration which is not merely 93 but (94, 123). When any 
one of the 9 points is omitted, the remaining 8 form a self-dual configuration 83 which 
may be regarded as a pair of mutually inscribed quadrangles (such as PoPyP5Ps and 
PoP3Pi0P¢). [Hilbert and Cohn-Vossen 1, pp. 101-102.] 

4. The geometry described in Ex. 3 is known as PG (2, 3). More generally, PG (2, p) 
is a finite plane in which each line contains p+ 1 points. Consequently, each point lies 
onp+llines. There are p?+p-+-1 points (and the same number of lines) altogether. In 
other words, the whole geometry is a configuration ng with n = p?+p+landd = p+l. 
(Actually p is not arbitrary, e.g., although it may be any power of an odd prime, for 
instance, 5, 7, or 9, it cannot be 6.)* The possibility of such finite planes indicates 
that the projective geometry defined by Axioms 14.11 to 14.15 is not categorical: it is not 
just one geometry but many geometries, in fact, infinitely many. 


5. In any finite projective geometry, Sylvester’s theorem (§ 4.7) is false. 


14.2 PROJECTIVE COORDINATES 


Modern algebra does not seem quite so terrifying when expressed in 
these geometrical terms! 

G, de B. Robinsan (1906 - ) 

[Robinson 1, p. 94] 


We saw, in § 13.7, that three real numbers 11, f2, tg will serve as barycentric 
coordinates for a point in the affine plane (with respect to any given triangle 
of reference) if and only if 


th + fo + t3 #0. 


Also a linear homogeneous equation 13.72 will serve as the equation for a 
line if and only if the coefficients T,, T2, T3 are not all equal. The remarks 


* By not insisting on Axiom 14.14, we can develop a “geometry of characteristic 2” in which p is 
a power of 2. By not insisting on Axiom 14.15, we can develop a “non-Desarguesian plane.” For 
the application to mutually orthagonal Latin squares, see Robinson 1, p. 161, Appendix II. 
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just after 13.84 indicate that these artificial restrictions will be avoided when 
we have extended the real affine plane to the real projective plane by adding 
the line at infinity 


Vr) t+ t2+t3=0 


and all its points (which are the points at infinity in various directions). 
When we interpret 7), 72, T3 as the distances from Aj, Ao, Ag to the line 


Tyty + Tote + T3t3 = 0, 


it is obvious that a parallel line is obtained by adding the same number to 
all three 7’s. Hence the point of intersection of two parallel lines satisfies 
14.21, that is, it lies on the line at infinity. 

To emphasize the fact that, in projective geometry, the line at infinity 
no longer plays a special role, we shall abandon the barycentric coordinates 
(11, fe, t3) in favor of general projective coordinates (x1, X2, Xs), given by 


ty = piX1, fe = peXe, 3 = Y3X3, 


where }41, fle, 3 are constants, pipes FO. Thus (x1, x2, x3) is the cen- 
troid of masses jt,.x, at A, (a = 1, 2, 3), and the line at infinity has the un- 
distinguished equation 


BiX1 + fox. + pax3 = 0. 


The contrast between these two kinds of coordinates may also be ex- 
pressed as follows. Barycentric coordinates can be referred to any given tri- 
angle; the “simplest” points 


(1, 0,0), (0, 1,0), (0, 0, 1) 


are the vertices, and the unit point (1, 1, 1) is the centroid. More usefully, 
projective coordinates can be referred to any given quadrangle! Taking 
three of the four vertices to determine a system of barycentric coordinates, 
suppose the fourth vertex is (j11, 2, Hs). By using these p’s for the transition 
to projective coordinates, we give this fourth vertex the new coordinates 
(1, 1, 1). Just as, in affine geometry, all triangles are alike, so in projective 
geometry all quadrangles are alike. 

To prove that projective coordinates provide a model (in the augmented 
affine plane) for the abstract projective plane described in § 14.1, we can take 
each of our geometric axioms and prove it analytically (i.e., algebraically). 

To prove 14.11, we merely have to observe that the line joining points 
(11, Ye, 3) and (21, Ze, Z3) 1S 
yo 3 
42 23 


ys Vi 


23 «21 


yi ye 
21 Z2 


14.22 x3 = 0 


x1 + Xo + 


(cf. 13.74). Similarly, for 14.12 (or rather, 14.121), the point of intersection 
of lines 2Y, x, = O and =Z, x, = Ois 
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| 


For 14.13, we can use the four points 
14.23 (1, 0,0), (0, 1,0), (,0,1), (d, 1, D. 
The diagonal points of the quadrangle so formed are 
(0, 1,1), €,0,), dd, 1, 0). 
If these three points lay on a line 2X, x, = 0, we should have 


14.24 Xo + X3 = 0, X3 + Xy = 0, xX, + Xo = 0, 
whence X; = X_p = X3 = 0, which is absurd. This proves 14.14. 


Yo Y3 Ys Vy Y1 Yo 


22 23 Z3 Z, Z1 Ze 


Finally, to prove 14.15 we use the coordinates 14.23 for the four points 
Ay, Ao, Az, C1. 

On the lines C 1A 1, CyAo, CyA3, which are 

X2 = %X%3, X3 = X1, X1 = X2, 
we take the points B,, Bs, By to be 

(1), da), d, 17. 

The three lines A3B1, 41B3, A2Bo, being 

X1 = PX2, X2 = 4X3, X3 = FX1, 
all pass through the same point C> if 
14.25 par = 1. 
The three lines 43B3, AB,, A1Bo, being 

Xq = GX1, X1 = PX3, X3 = FXe, 
all pass through the same point C3 if 


qpr = 1. 

Since this condition agrees with 14.25, the proof is complete. However, 
it is important to observe that the above deduction can be carried through 
in the more general situation where the coordinates belong not to a field 
but to an arbitrary division ring [Birkhoff and MacLane 1, p. 222]. We can 
still speak of points and lines, but Axiom 14.15 will have to be replaced by 
a weaker statement if the coordinate ring includes elements p and q such 


that 
pq # ap. 


For instance, we might have p = k and q = j in a “quaternion geometry” 
whose coordinates are based on “units” i, j, k satisfying 


Ria = Ba Hek = AL, 
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When the A’s and B’s are so chosen, 14.15 is false. We have thus estab- 
lished an important connection between geometry and algebra: Hilbert’s 
discovery that, when homogeneous coordinates are used in a plane satisfy- 
ing the first four axioms, Pappus’s theorem is equivalent to the commutative 
law for multiplication. 

EXERCISES 


1. Given five points, no three collinear, we can assign the coordinates 14.23 to any 
four of them, and then the coordinates (14, x2, x3) of the fifth are definite (apart from 
the possibility of multiplying all by the same constant). If the mutual ratios of the 
three x’s are rational, we can multiply by a “common denominator” so as to make them 
all integral. In this case we can derive the fifth point from the first four by a linear 
construction, involving a finite sequence of operations of joining two known points or 
taking the point of intersection of two known lines. Devise such a construction for 
the point (1, 2, 3). 

2. The four points (1, £1, +1) form a complete quadrangle whose diagonal tri- 
angle is the triangle of reference. 

3. A configuration 83, consisting of two mutually inscribed quadrangles, exists in 
the complex projective plane, but not in the real projective plane. When it does exist, 
its eight points appear in four pairs of “opposites” whose joins are concurrent. The 
complete figure is a (94, 123). Hint: Let the two quadrangles be PoP2PsP,_ and 
P,P3P5P7, so that the sets of three collinear points are 


PoPjP3, PiP2Ps, PoP3Ps5, P3P4P6, PsPsP71, PsP6Po, PePiPi, Pr PoP. 


Take PoP, Ps as triangle of reference and let P3, Ps, Pz be (1, 1, 0), (0, 1, 1), C1, 9, x). 
Deduce that P; and Pg are (1, 1, x + 1) and (1, x + 1, x). Obtain an equation for x 
from the collinearity of PoPsPe. 

4. Ifp is an odd prime, a finite projective plane PG(2, p) can be obtained by tak- 
ing the coordinates to belong to the field GF(p) which consists of the p residues (or, 
strictly, residue classes) modulo p [Ball 1, pp. 60-61]. For instance, the appropriate 
“finite arithmetic” for PG(2, 3) consists of symbols 0, 1, 2 which behave like ordinary 
integers except that 

142=0 and 2x2=1. 


In the notation of Ex. 3 at the end of § 14.1, take PoP, Pe to be the triangle of reference 
and P; the unit point (1, 1, 1). Find coordinates for the remaining points, and equations 
for the lines. 

Finite planes, and the analogous finite n-spaces PG(n, p), were discovered by von 
Staudt* and rediscovered by Fano. Von Staudt took n to be 2 or 3. Fano took p to be 
a prime. The generalization PG(n, p*) is credited to Veblen and Bussey. 


5. Taking the coordinates to belong to GF(2), which consists of the two “numbers” 
0 and | with the rule for addition 
14+ 1-40, 


we obtain a finite “geometry” in which the diagonal points of a complete quadrangle 
are always collinear! Our proof of 14.14 breaks down because now the equations 14.24 
*K.G.C. von Staudt, Beitrdge zur Geometrie der Lage, vol. 1 (Ntrnberg, 1856), pp. 87-88; Gino 


Fano, Giornale di Matematiche, 30 (1892), pp. 114-124; Veblen and Bussey, Transactions of the 
American Mathematical Society, 7 (1906), pp. 241-259. 
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have not only the inadmissible solution X; = X2 = X3 = 0 but also the significant 
solution X,; = X2 = X3 = 1, which yields the line 


Xy + Xo + x3 = 0. 


This PG(2, 2) can be described abstractly by calling its seven points P; and its seven 
lines p; i = 0, 1,..., 6) with the rule that P; and p, are incident if and only if 


i+ j=0,1lor3 (mod 7). 


14.3. DESARGUES’S THEOREM 


The fundamental idea for this pure geometry came from the desire of 

Renaissance painters to produce a ‘‘visual"’ geometry. How do things 

really look, and how can they be presented on the plane of fhe draw- 

ing? For example, there will be no parallel lines, since such lines ap- 
pear fo the eye to converge. 

S. H. Gould (1909 - ) 

[Gould 1, p. 298] 


Two triangles, with their vertices named in a particular order, are said to 
be perspective from a point (or briefly, “perspective”) if their three pairs of 
corresponding vertices are joined by concurrent lines. For instance, in Fig- 
ure 14.15, the triangles 4142A3 and B,B3Bp (sic) are perspective from Cj. 
By permuting the vertices of B,B3By cyclically, either forwards or back- 
wards, we see that the same two triangles are also perspective from C2 or 
C3. In fact, one of the neatest statements of Axiom 14.15 [see Veblen and 
Young 1, p. 100] is: 


If two triangles are doubly perspective they are trebly perspective. 


Dually, two triangles are said to be perspective from a line if their three 
pairs of corresponding sides meet in collinear points. It was observed by 
G. Hessenberg* that our axioms suffice for a proof of 


DESARGUES’S THEOREM. Jf two triangles are perspective from a point they 
are perspective from a line, and conversely. 


The details are as follows. Let two triangles PQR and P’Q’R’ be per- 
spective from O, as in Figure 14.3a, and let their corresponding sides meet 
in points 

D=QR:QO'R’, E=RP-R'P’, F= PO:-P'?. 
We wish to prove that D, E, F are collinear. After defining four further 
points 
S = PR: O'R, T = PQ’: OR, 
U= PQ: OS, V=PO-OS, 
we have, in general, enough triads of collinear points to make three applica- 
tions of Axiom 14.15. The “matrix” notation enables us to write simply 
* Mathematische Annalen, 61 (1905), pp. 161-172. 
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ood O P P POT 
PS RKR| ,J| oR S| vos 
DT U EV T DE F 


The last row of the last matrix exhibits the desired collinearity. 
The converse follows by the principle of duality. 


Figure 14.34 


EXERCISES 


1. The triangle (p, 1, 1) (1, g, 1) C1, 1, 7) is perspective with the triangle of refer- 
ence from the unit point (1, 1, 1). Pairs of corresponding sides meet in the three col- 
linear points 

(0,¢g—l,1—-n, (L—p.0,r-—l, (p—1,1—- 4,0). 

2. Desargues’s theorem involves 10 points and 10 lines, forming a configuration 103. 
To obtain a symmetrical notation, consider triangles P\4P24P34 and Pi5Po5P35, perspec- 
tive from a point P45 and consequently from a line Pe3P3;Pi2. Then three points P;; 
are collinear if their subscripts involve just three of the numbers |, 2, 3, 4,5. If the 
remaining two of the five numbers are k and /, we may call the line py;. Then the 
same two triangles may be described as p45 pos p35 and py4 pos P34, perspective from the 
line pas. 

3. In the finite projective plane PG(2, 3), the two triangles P;P2P7 and P3PsP4 are 
perspective from the point Po and from the line PoPi2Pyo. Identify the remaining points 
in Figure 14.3a, (In this special geometry, U and V both coincide with F, which is 
not surprising in view of the fact that Figure 14.3a involves 14 points whereas the whole 
plane contains only 13.) 


14.4 QUADRANGULAR AND HARMONIC SETS 


Desargues’s theorem enables us to prove an important property of a 
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quadrangular set of points, which is the section of the six sides of a com- 
plete quadrangle by any line that does not pass through a vertex: 


14.41 Each point of a quadrangular set is uniquely determined by the re- 
maining points. 

Proof. Let PQRS be a complete quadrangle whose sides PS, OS, RS, 
OR, RP, PQ meet a line g (not through a vertex) in six points A, B, C, D, 
E, F, certain pairs of which may possibly coincide. (The first three points 
come from three sides all containing the same vertex S; the last three from 
the respectively opposite sides, which form the triangle POR.) To show 
that F is uniquely determined by the remaining five points, we set up an- 
other quadrangle P’Q’R’S’ whose first five sides pass through A, B, C, D, E, 
as in Figure 14.4a. Since the two triangles PRS and P’R’S’ are perspective 
from the line g, the converse of Desargues’s theorem tells us that they 
are also perspective from a point; thus PP’ passes through the point 
O = RR’: SS’. Similarly, the perspective triangles ORS and Q’R’S’ show 
that QQ’ passes through this same point O. In fact, all the four lines PP’ 
QO’, RR’, SS’ pass through O, so that PORS and P’Q’R’S’ are “persp 
tive quadrangles.” By the direct form of Desargues’s theorem, the triang 
POR and P’Q’R’, which are perspective from the point O, are also persr 
tive from the line DE, which is g; that is, the sides PQ and P’Q’ both r 
g in the same point F. , 


Figure 14.44 


We shall find it convenient to use the symbol 
(AD) (BE) (CF) 


to denote the statement that the six points form a quadrangular set in the 
above manner. This statement is evidently unchanged if we apply any per- 
mutation to .4BC and the same permutation to DEF. It is also equivalent 
to any of 


(AD) (EB) (FC), (DA) (BE) (FC), (DA) (EB) (CF). 


To obtain other permutations we need a new quadrangle. With the ex- 
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Figure 14.4b 


ercise of some ingenuity we can retain two of the four old vertices, say Q 
and S. Defining 


R' =OR-SF, P’ =PS-QC, 


as in Figure 14.45, we apply Axiom 14.15 to the hexagon PROCFS accord- 
ing to the scheme 


P F OQ 
CR S|, 
RP E 


with the conclusion that R’P’ passes through E. Just as the quadrangle 
PQRS yields (AD) (BE) (CF), the quadrangle P’OR’S yields(AD) (BE) 
(FC). In other words, the statement (4D) (BE) (CF) implies (AD) (BE) 
(FC), and hence also 


14.42 (AD) (BE) (CF) implies (DA) (EB) (FC). 
In the important special case (AA) (BB) (CF), which is abbreviated to 
H(AB, CF), 


we say that the four points form a harmonic set, or, more precisely, that F 
is the harmonic conjugate of C with respect to A and B. This means that A 
and B are two of the three diagonal points of a quadrangle while C and F 
lie respectively on the remaining sides, that is, on the sides that pass through 
the third diagonal point. Axiom 14.14 tells us that the harmonic conjugates 
C and F are distinct (except in the degenerate case when they coincide with 
A or B). 


> 
Q 
& 
"yj 


Figure 14.4c 
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EXERCISES 

1. H(AB, CF) is equivalent to H(BA, CF) or H(AB, FC) or H(BA, FC). 

2. Describe in detail a construction for the harmonic conjugate of C with respect 
to two given points A and B (on a line through C, as in Figure 14.4c). 

3. The harmonic conjugate of (0, 1, A) with respect to (0, 1,0) and (0,0, 1) is 
(0, 1, 2). 

4. In PG(2, 3) Gee Ex. 3 at the end of § 14.1), every set of four collinear points is a 
harmonic set in every order; e.g., H( PoPi, P3Ps), H( PoP3, PoPi), H( PoPs, PiP3). 

5. In Figure 6.6a, H(AA’, AiA42). Deduce the metrical definition 


AA, AA» 


A\A’ A’Ad 


for a harmonic set. (Hint: Defining E’ as in Ex. 4 at the end of § 6.6, consider the 
quadrangle formed by P, E, E’ and the point at infinity on A,P.) 


14.5 PROJECTIVITIES 


A range is the set of all points on a line. Dually, a pencil is the set of all 
lines through a point. Ranges and pencils are instances of one-dimensional 
forms. We shall often have occasion to consider a (one-to-one) correspond- 
ence between two one-dimensional forms. The simplest possible corre- 
spondence between a range and a pencil arises when the lines of the pencil 
join the points of the range to another point, so that the range is a section 
of the pencil. The correspondence between two ranges that are sections of 
one pencil by two distinct lines is called a perspectivity; in such a case we 
write 


Ra Xe KOM, 

A A 
meaning that, if X and X’ are corresponding points of the two ranges, their 
join XX’ continually passes through a fixed point O, which we call the cen- 
ter of the perspectivity. There is naturally also a dual kind of perspectivity 
relating pencils instead of ranges. 

The product of any number of perspectivities is called a projectivity. Two 
ranges (or pencils) related by a projectivity are said to be projectively related, 
and we write 


¢ 
XxX A x . 
For instance, in the circumstances illustrated in Figure 14.5a, 


ABCD 2 AgBoCoDo 2 A'B'C'D’, ABCD x A'B'CD’. 
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Figure 14.5a Figure 14.5b 


Analogously, we can define a projectivity relating a range to a pencil, or vice 
versa. 

Given three distinct points A, B, C on a line, and three distinct points 
A’, B’, C’ on another line, we can relate them by a pair of perspectivities in 
the manner of Figure 14.55, where the axis (or “intermediary line’’) of the 
projectivity joins the points 


By = AB’: BA’, Co = AC'* CA’, 
so that ABC 4’ AgBoCo 4 ABC. 


For each point X on AB we obtain a corresponding point X’ on A’B’ by 
joining A to the point Xp = A’X+ BoC, so that 


ABCX 4 AoBoCoXo 4 ABCX. 


By Axiom 14.15, the axis BoCo, being the “Pappus line” of the hexagon 
AB’CA’BC’, contains the point BC’: CB’. Similarly, it contains the point 
of intersection of the “cross joins” of any two pairs of corresponding points. 
In particular, we could have derived the same point X’ from a given point 
X by using perspectivities from B’ and B (or any other pair of correspond- 
ing points) instead of A’ and A. 

It can be proved [Baker 1, pp. 62-64; Robinson 1, pp. 24-36] that the 
product of any number of perspectivities can be reduced to such a product 
of two, provided the initial and final ranges are not on the same line. In 
other words, 


14.51 Any projectivity relating ranges on two distinct lines is expressible 
as the product of two perspectivities whose centers are corresponding points (in 
reversed order) of the two related ranges. 


To relate two triads of distinct points ABC and A’B’C’ on one line, we 
may use an arbitrary perspectivity ABC = A,B,C, to obtain a triad on an- 
other line, and then relate 4,B,C, to A’B’C’ as in 14.51. Hence 
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14.52 It is possible, by a sequence of not more than three perspectivities, 
to relate any three distinct collinear points to any other three distinct collinear 
points. 

A projectivity X — X’ on one line may have one or more invariant points 
(such that X = X’). If it has more than two invariant points, it is merely 
the identity, X ~ X. In fact, the above construction for a projectivity 


ABCX x ABCX’ 


on one line involves four points on another line such that 
ABCX = Ay ByC X1 x ABCX’. 


By 14.51, there is essentially only one projectivity 4,B,C, = ABC. We have 
thus proved 

THE FUNDAMENTAL THEOREM OF PROJECTIVE GEOMETRY. 4 projectivity 
is determined when three points of one range and the corresponding three points 
of the other are given. 

Ifa projectivity relating ranges on two distinct lines has an invariant point 
A, this point, belonging to both ranges, must be the common point of the 
two lines, as in Figure 14.5c. Let B and C be any other points of the first 
range, B’ and C’ the corresponding points of the second. The fundamental 
theorem tells us that the perspectivity 


ABC 2 ABC, 
where O = BB’: CC’, is the same as the given projectivity ABC x ABC. 
Hence 
14.53 A projectivity between two distinct lines is equivalent to a perspec- 
tivity if and only if their point of intersection is invariant. 


R 


A D Cc BE F 


Figure 14.5¢ Figure 14.5d 


Returning to the notion of a projectivity between ranges on one line 
(i.e., a projective transformation of the line into itself), we recall that, if such 
a transformation is not merely the identity, it cannot have more than two 
invariant points. It is said to be elliptic, parabolic, or hyperbolic according 
as the-number of invariant points is 0, 1, or 2. When coordinates are used, 
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invariant points arise from roots of quadratic equations; thus elliptic pro- 
jectivities do not occur in complex geometry, but 


ABC a BCA 
is elliptic in real geometry [Coxeter 2, p. 48]. 


Figure 14.5d (cf. 14.4a) suggests a simple construction for a hyperbolic 
projectivity ABF x ACE in which one of the invariant points is given: 


Q 
A 
Here S and P may be any two points collinear with A, and then the other 
two vertices of the quadrangle are 


QO = BS: FP, R= CS: EP. 


The second invariant point is evidently D, on QR. When the same projec- 
tivity is expressed in the form ADB = ADC (that is, when both invariant 
points are given), we have the analogous construction 
ApB £ aus 8 apc, 

A A 
where U = AS-QD. This can still be carried out if A and D coincide (Le., if 
g passes through the diagonal point U = PS + QR of the quadrangle), in 
which case we have the parabolic projectivity 


AAB rs AAC 


ABF © ASP & ACE. 


[Coxeter 2, p. 50]. 


Figure 14.5e 


An involution is a projectivity of period 2, that is, a projectivity which in- 
terchanges pairs of points. Figure 14.5e is derived from Figure 14.5d by 
adding extra points T, W, Z. We may imagine this figure to have been de- 
rived from any four given collinear points A, C, D, F by taking a point R 
outside their line, letting the joins RA, RD, RC meet an arbitrary line 
through Fin T, Q, W, respectively, and then taking Z = AQ+ RC. Since 
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ADcF £ zrcw 4 orrw 8 pDAFc, 
A A A 
we have 


14.54 ADCF rt DAFC. 


But, by the fundamental theorem, there is only one projectivity ADC x DAF. 
Hence, if a projectivity interchanging A and D transforms C into F, it inter- 
changes C and F. In other words, 


14.55 Any projectivity that interchanges two points is an involution. 
Applying the same set of three perspectivities to another point B, we have 
pes4pke 
oa aa 

Since Q(A BC, DEF), we have now proved the theorem of the quadrangu- 
lar set: 

14.56 The three pairs of opposite sides of a quadrangle meet any line (not 
through a vertex) in three pairs of an involution. 

Combining this with 14.55, we have an alternative proof for 14.42 [Veb- 
len and Young 1, p. 101]. 


Since the involution ACD x DFA is determined by its pairs AD and CF 
(or any other two of its pairs), it is conveniently denoted by 


(AD\(CF) 


or (DA)(CF) or (CF (AD), etc. Thus (AD)(BE)(CF) implies that the pair 
BE belongs to (AD)(CF), and CF to (AD)(BE), and AD to (BE)(CF). The 
points in a pair are not necessarily distinct. When A = D and B = E, so 
that H(AB, CF), we have the hyperbolic involution (AA)(BB) which inter- 
changes pairs of harmonic conjugates with respect to A and B. Since this 
same involution is expressible as (AA)(CF), 


14.57 If an involution has one invariant point, it has another, and con- 
sists of the correspondence between harmonic conjugates with respect to these 
two points. 


It follows that there is no parabolic involution. 


EXERCISES 
1. Let the lines OA,OB,..., O1A’, O\B’,... and AoBo in Figure 14.5a be denoted 
by a,b,..., a’, b’,...ando. Use the principle of duality to justify the notation 


abcd ¢ ab'e'd’. 
2. The harmonic property is invariant under a projectivity: if H(4B, CF) and 
ABCF = A'B'C’F,, then H(A4’B’, C’F’) [Coxeter 2, p. 23]. 
3. H(4B, CF) implies H(CF, AB). (Hint: By 14.54, ACBF ~ CAFB.) 
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4. Draw a quadrangle and its section, as in Figure 14.5d. Take an arbitrary point 
X on g and construct the corresponding point X’ in the hyperbolic projectivity 


ABF ~ ACE, 

Do the same for ADB R ADC, and draw the modified figure that is appropriate for 
the parabolic projectivity AAB = AAC. 

5. Two perspectivities cannot suffice for the construction of an elliptic projectivity. 

6. In the notation of Figure 14.45, 
g 
A 

7. Any projectivity may be expressed as the product of two involutions [Coxeter 2, 
p. 54]. 

8. The projectivities on the line x3 = 0 are the linear transformations 


S 


ADCF = AUP’P z DUR’R z DAFC, 


px’) = C11X1 + Ci2Xe, 
px’g = CoiX1 + Co0Xo, 


where ¢43C2254 C1221. Under what circumstances is such a projectivity (i) parabolic, 
(ii) an involution? 


14.6 COLLINEATIONS AND CORRELATIONS 


A collineation is a transformation (of the plane) which transforms collinear 
points into collinear points. Thus it transforms lines into lines, ranges into 
ranges, pencils into pencils, quadrangles into quadrangles, and so on. A 
projective collineation is a collineation which transforms every one-dimen- 
sional form projectively. 


14.61 Any collineation that transforms one range into a projectively re- 
lated range is a projective collineation. 

Proof [Bachmann 1, p. 85]. Let the given collineation transform the range 
of points X on a certain line a into a projectively related range of points X’ 
on the corresponding line a’, and let it transform the points Y on another 
line b into corresponding points Y’ on db’. Any perspectivity relating X and Y 
will be transformed into a perspectivity relating X’ and Y’. Hence 


omy a 
EA me 


so that the collineation induces a projectivity Y = Y’ between the points of 
b and b’, as desired. 


It follows that a projective collineation is determined when two corre- 
sponding quadrangles (or quadrilaterals) are given [Coxeter 2, p. 60]. 

A perspective collineation with center O and axis o is a collineation which 
leaves invariant all the lines through O and all the points on 0. (By 14.61, 
every perspective collineation is a projective collineation.) Following Sophus 
Lie (1842-1899), we call a perspective collineation an elation or a homology 
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according as the center and axis are or are not incident. A harmonic ho- 
mology is the special case when corresponding points A and A’, on a line a 
through O, are harmonic conjugates with respect to O ando:a. Every pro- 
jective collineation of period 2 is a harmonic homology [Coxeter 2, p. 64]. 

We have seen that a collineation is a point-to-point and line-to-line trans- 
formation which preserves incidences. Somewhat analogously, a correla- 
tion is a point-to-line and line-to-point transformation which dualizes inci- 
dences: it transforms points A into lines a’, and lines b into points B’, in 
such a way that a’ passes through B’ if and only if A lies on 6. Thus a cor- 
relation transforms collinear points into concurrent lines (and vice versa), 
ranges into pencils, quadrangles into quadrilaterals, and so on. A projec- 
tive correlation is a correlation that transforms every one-dimensional form 
projectively. In a manner resembling the proof of 14.61, we can establish 


14.62 Any correlation that transforms one range into a projectively re- 
lated pencil (or vice versa) is a projective correlation. 


It follows that a projective correlation is determined when a quadrangle 
and the corresponding quadrilateral are given [Coxeter 2, p. 66]. 


A polarity is a projective correlation of period 2. In general, a correlation 
transforms a point A into a line a’ and transforms this line into a new point 
A”, When the correlation is of period two, A” always coincides with A and 
we can simplify the notation by omitting the prime (’). Thus a polarity re- 
lates A to a, and vice versa. Following J. D. Gergonne (1771-1859), we 
call a the polar of A, and A the pole ofa. Clearly, the polars of all the points 
on a form a projectively related pencil of lines through A. 

Since a polarity dualizes incidences, if A lies on b, a passes through B. 
In this case we say that A and B are conjugate points, a and b are conjugate 
lines, It may happen that A and aare incident, so that each is self-conjugate. 
We can be sure that this does not always happen, for it is easy to prove that 
the join of two self-conjugate points cannot be a self-conjugate line. It is 
slightly harder to prove that no line can contain more than two self-con- 
jugate points [Coxeter 2, p. 68]. The following theorem will be used in 
§ 14.7: 


14.63 A polarity induces an involution of conjugate points on any line that 
is not self-conjugate. 
Proof. On a non-self-conjugate line a, the projectivity ¥ — a-x (Figure 
: 2 IS 
14.6a) transforms any non-self-conjugate point Binto another point C = a-b, 
whose polar is AB. The same projectivity transforms C into B. Since it 
interchanges B and C, it must be an involution. 


Dually, x and AX are paired in the involution of conjugate lines through A. 
Such a triangle ABC, in which each vertex is the pole of the opposite side 
(so that any two vertices are conjugate points, and any two sides are con- 
jugate lines), is said to be self-polar. If P is any point not on a side, its 
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polar p does not pass through a vertex, and the polarity may be described 
as the unique projective correlation that transforms the quadrangle A BCP 
into the quadrilateral abcp. An appropriate symbol, analogous to the sym- 
bol (4 B)\(PQ) for an involution, is 


(ABO\(Pp). 

Thus any triangle A BC, any point P not on a side, and any line p not through 
a vertex, determine a definite polarity (ABC)(Pp), in which the polar x of 
an arbitrary point X can be constructed by simple incidences. As a first 
step towards this construction we need the following lemma:* 


Figure 14.6a 


14.64 If the polars of the vertices of a triangle are distinct from the re- 
spectively opposite sides, they meet these sides in three collinear points. 

Proof. Let APX be a triangle whose sides PX, XA, AP meet the polars 
a, p, x of its vertices in points Ai, P1, X1, as in Figure 14.65. The polar of 
X, = x- AP is, of course, x1 = X(a+ p). Define also the extra points 
P’=a-AP, X’ =a:+AX and their polars p’ = A(a-p), x’ = A(a-x). 
By 14.54 and the polarity, we have 


AP'PX, ~ P'AX,P x plaxip ~ AX’XPy. 


By 14.53, AP’PX,; = AX’XP,. Since the center of this perspectivity is 
P’X’> PX = A4, the three points A1, Py, X1 are collinear, as desired. 


We are now ready for the construction (Figure 14.6c): 


14.65 Inthe polarity (ABC)(Pp), the polar of a point X (not on AP, BP, 
or p) is the line XX» determined by 


* This is known as Chasles’s theorem. The proof given in The Real Projective Plane [Coxeter 
2, p. 71] suffices for real geometry but not for the more general geometry which is developed 
here. Lemma 5.54 of that book is false in the finite geometry PG(2, 3), which admits a quadri- 
lateral whose three pairs of opposite vertices P;P2, P3Ps, P5P9 are pairs of conjugate points in 
the polarity P; > p; although the four sides P)P3P9, P2P6Po, P2P3Ps, P1P5P¢ contain their re- 
spective poles Po. Pz, Ps. Pir. (The remaining three of the thirteen points in this finite plane are 
the diagonal points of the quadrangle PoP7PaP1:; their joins in pairs are the diagonals of the 
quadrilateral po p7 Ps pir.) See also W. G. Brown, Canadian Mathematical Bulletin, 3 (1960). pp. 
221-223. 
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Figure 14.6b Figure 14.6c 


A, =a: PX, P, = p: AX, X, = AP A;Pi, 
By = b- PX, Po = p: BX, Xo = BP* BoPo. 


Proof. By 14.64, the polars a, p, x meet the lines PX, AX, AP in three 
collinear points, the first two of which are A; and P,. Hence x passes 
through X, = AP~- A,P . Similarly x passes through X, = BP + BoPs. 


In terms of coordinates, a projective collineation is a linear homogene- 
ous transformation 


14.66 BX'g = VapXp, det(Cag) F 0, 


where the summation is understood to be taken over the repeated index B 
(for each value of a). The nonvanishing of the determinant makes it pos- 
sible to solve the equations for x, in terms of x’, so as to obtain the in- 
verse collineation. By suitably adjusting the coefficients c.g, we can trans- 
form the particular quadrangle 14.23 into any given quadrangle [Coxeter 2, 
p. 197]. 

Since the product of two correlations (e.g., a polarity and another corre- 
lation) is a collineation, any given projective correlation can be exhibited 
as the product of an arbitrary polarity and a suitable projective collinea- 
tion. The most convenient polarity for this purpose is that in which the line 


DX Xe = 0 


is the polar of the point (Xi, X2, Xs). Combining this with the general col- 


lineation 14.66, we obtain the correlation that transforms each point (y) 
into the line ; 


14.661 X(2CaaVaXa = 0, 
where again we must have det(c,g) #0. In fact, the correlation is associ- 
ated with the bilinear equation 

TUcepXa¥p = O 


[cf. Coxeter 2, p. 200]. 
The correlation is a polarity if it is the same as its inverse, whose equa- 
tion, derived by interchanging (x) and (9), is 
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VVeaeXs = 0, Or TWegeXaV~e = O. 


Thus a polarity occurs when ¢gq = ACag, where A is the same for all a and £, 
so that Cag = ACga = A%Cag, A? = 1,A = +1. ButwecannothaveA = —], 
as this would make the determinant 


0 C12) — C31 
—C12 0 C23 = 0. 
C31 C23 0 


Hence A = 1, and Cpa = Cag In other words, 


14.67 A projective correlation is a polarity if and only if its matrix of co- 
efficients is symmetric. 


Thus the general polarity is given by 
14.68 ZDcapXVe = 9, Coa = Cag, Aet(Cag) ~ O, 


meaning that the polar of (1, ye, v3) is 14.661, or that 14.68 is the condi- 
tion for points (x) and (y) to be conjugate. Setting yg = xg, we deduce 
the condition 


ZUcagXaXg = O, 


OF Cy1X12 + CogX2? + C33X3" + 2Co3XQX3 + 2C31X3X1 + 2i2XiX2 = O, 
for the point (x) to be self-conjugate. Hence 
14.69 Ifa polarity admits self-conjugate points, their locus is given by an 
equation of the second degree. 
EXERCISES 


1. Given the center and axis of a perspective collineation, and one pair of corre- 
sponding points (collinear with the center), set up a construction for the transform X’ 
of any point X [Coxeter 2, p. 62]. 

2. Any two perspective triangles are related by a perspective collineation. 

3. A collineation which leaves just the points of one line invariant is an elation. 

4. An elation with axis 0 may be expressed as the product of two harmonic ho- 
mologies having this same axis o [Coxeter 2, p. 63]. 

5. In PG(2, 3), the transformation P; — P;4, (with subscripts reduced modulo 13) 
is evidently a collineation of period 13. Is it a projective collineation? Consider also 
the transformation P; > P3;. 

6. What kind of collineation is 

(GQ) x4 = x4, X’g = Xe, = X’3 = CX! 
(ii) xy = Xi + (4X3, X29 = Xo + C2X3, x’3 = x3? 

7. Use 14.64 to prove Hesse’s theorem: If two pairs of opposite vertices of a com- 
plete quadrilateral are pairs of conjugate points (in a given polarity), then the third 
pair of opposite vertices is likewise a pair of conjugate points. 

8. Give an analytic proof of Hesse’s theorem. (Hint: Apply the condition 14.68 to 
the pairs of vertices 

(0,1, +1), (+1,0,), C, +1, 0) 
of the quadrilateral x; + x2 + x3 = 0.) 
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9. The bilinear equation 
Xi + Xave + Xayzs = 0 
is the condition for (x) and () to be conjugate in the polarity (ABC)(Pp), where ABC 
is the triangle of reference, P is (1,1, 1), andpis x1+x%2+2x3 = 0. Are there any 


self-conjugate points? Consider, in particular, the case when the coordinates are residues 
modulo 3. 


14.7 THE CONIC 


The three familiar curves which we call the ‘‘conic sections” have a 
long history. The reputed discoverer was Menaechmus, who flour- 
ished about 350 B.c. They attracted the attention of the best of the 
Greek geometers down to the time of Pappus of Alexandria.... A 
vivid new interest arose in the seventeenth century.... {ft seems cer- 
tain that they will always hold a place in the mathematical curriculum. 

J. L. Coolidge (1873 -1954) 


[Coolidge 1, Preface] 


In the projective plane there is only one kind of conic. The familiar dis- 
tinction between the hyperbola, parabola, and ellipse belongs to affine 
geometry. To be precise, it depends on whether the line at infinity is a 
secant, a tangent, or a nonsecant [Coxeter 2, p. 129]. 


Cc; 


Figure 14.74 


A polarity is said to be hyperbolic or elliptic according as it does or does 
not admit a self-conjugate point. (In the former case it also admits a self- 
conjugate line: the polar of the point.) The self-conjugate point P, whose 
existence suffices to make a polarity hyperbolic, is by no means the only 
self-conjugate point: there is another on every line through P except its polar 
p. To prove this we use 14.63, which tells us that every such line contains 
an involution of conjugate points. By 14.57, this involution, having one in- 
variant point P, has a second invariant point Q, which is, of course, another 
self-conjugate point of the polarity. Thus the presence of one self-con- 
jugate-point implies the presence of many (as many as the lines through a 
point; for example, infinitely many in real or complex geometry). Their 
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locus is a conic, and their polars are its tangents. This simple definition, 
due to von Staudt, exhibits the conic as a self-dual figure: the locus of self- 
conjugate points and also the envelope of self-conjugate lines. 

The reader must bear in mind that there are only two kinds of polarity 
and that there is only one kind of conic. The terminology is perhaps not 
very well chosen: a hyperbolic polarity has many self-conjugate points, form- 
ing a conic; an elliptic polarity has no self-conjugate points at all, but still 
provides a polar for each point and a pole for each line; there is no such 
thing as a “parabolic polarity.” 

A tangent justifies its name by meeting the conic only at its pole, the point 
of contact. Any other line is called a secant or a nonsecant according as it 
meets the conic twice or not at all, that is, according as the involution of 
conjugate points on it is hyperbolic or elliptic. Any two conjugate points 
on a secant PQ, being paired in the involution (PP)(QQ), are harmonic con- 
jugates with respect to P and Q. 

Let PQR be a triangle inscribed in a conic, as in Figure 14.7a. Any line 
c conjugate to PQ is the polar of some point C on PQ. Let RC meet the 
conic againin S. Then C is one of the three diagonal points of the inscribed 
quadrangle PORS. The other two are 


A = PS-OR, B= OS-RP. 


Their join meets the sides PQ and RS in points C; and C2 such that H(PQ, 
CC.) and H(RS, CC2). Since C; and C2 are conjugate to C, the line AB, 
which contains them, is c, the polar of C. Similarly BC is the polar of A. 
Therefore A and B are conjugate points. These conjugate points are the 
intersections of c with the sides QR and RP of the given triangle. Hence 


SEYDEWITZ’S THEOREM, Ifa triangle is inscribed in a conic, any line con- 
jugate to one side meets the other two sides in conjugate points. 


From this we shall have no difficulty in deducing 


STEINER’S THEOREM. Let lines x and y join a variable point on a conic to 
two fixed points on the same conic; then x ~ y. 

Proof. The tangents p and gq, at the fixed points P and Q, intersect in D, 
the pole of PQ. Let c be a fixed line through D (but not through P or Q), 
meeting x and y in B and A, as in Figure 14.7. By Seydewitz’s theorem, 


Figure 14.7b 
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By 


Figure 14.7¢ 


BA is a pair of the involution of conjugate points on c. Hence, when the 
point x - y varies on the conic, 


The following construction for a conic through five given points, no three 
collinear, was discovered by Braikenridge and Maclaurin independently, 


about 1733 [Coxeter 2, p. 91]. Let Ai, Bo, Ci, As, By be the five points, as 
in Figure 14.7c; then the conic is the locus of the point 


Co = A(z . C1A2) 2 Byz : C,B2), 
where z is a variable line through the point A,B. + B,A2. This is the con- 
verse of 
PASCAL’S THEOREM. If a hexagon A,B2C,A2B Cy is inscribed in a conic, 
the points of intersection of pairs of opposite sides, namely, 
BiC, BoC, CiA2° C2A1, A,B: AoBi, 
are collinear. 


Pascal discovered his famous theorem [Coxeter 2, p. 103] when he was 
only sixteen years old. More than 150 years later, it was dualized (see Fig- 
ure 14.7d): 


BRIANCHON'S THEOREM. Ifa hexagon is circumscribed about a conic, its 
three diagonals are concurrent. 


We saw, in § 8.4, that the familiar conics of Euclidean geometry have equa- 
tions of the second degree in Cartesian coordinates. The same equations 
in affine coordinates remain valid in affine geometry, and yield homogene- 
ous equations of the second degree in barycentric coordinates (§ 13.7) and 
in projective coordinates (§ 14.2). Thus 14.69 serves to reconcile von Staudt’s 
definition of a conic with the classical definitions. In particular, 


X14X%3 = X22 


is a conic touching the lines x3 = 0 and x, = 0 at the respective points 
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i) 


ao 


by 


Figure 14.7d 


(1, 0, 0) and (0, 0, 1). This conic can be parametrized in the form 


Xp i xXei xg = fit: 1 


. 


which exhibits it as the locus of the point of intersection of corresponding 
members of the projectively related pencils of lines 


X1 = tx. and Xe. = 1x3. 


If det (c.g) = 0, the quadratic form 22c.gxX,Xg may be expressible as the 
product of two linear forms Za,x, and 2b,x,. Accordingly, a pair of lines 
is sometimes regarded as a degenerate conic. In this sense, Axiom 14.15 is 
a special case of Pascal’s theorem. 


EXERCISES 


1. If a quadrangle is inscribed in a conic, its diagonal points form a self-polar tri- 
angle. The tangents at the vertices of the quadrangle form a circumscribed quadrilat- 
eral whose diagonals are the sides of the same triangle [Coxeter 2, pp. 85, 86]. 

2. Referring to the projectivity x = pot Steiner’s theorem, investigate the special 
positions of x and vy when A or B coincides with D. 

3. if a projectivity between pencils of lines x and y through P and @Q has the effect 
xpd x ydq, where dis PQ, the locus of the point x +} is a conic through P and Q whose 
tangents at these points are p and q. (Thisconstruction is often used to define a conic; 
see, e.g., Robinson [1, p. 38].) 

4. Of the conics that touch two given lines at given points, those which meet a third 
line (not through either of the points) do so in pairs of an involution [Coxeter 2, p. 90]. 

5. Iftwo triangles are self-polar for a given polarity, their six vertices lie on a conic 
or on two lines [Coxeter 2, p. 93]. 

6. If two triangles have six distinct vertices, all lying on a conic, they are self-polar 
for some polarity [Coxeter 2, p. 94]. 

7. In PG(, 3) (Ex. 3 at the end of § 14.1), the polarity P; — pi or ( PsPioPi2\ Po Po) 
determines a conic consisting of the four points Po, P7, Ps, Py, and the four lines po, 
Pr Ps, Pir. (Hint: PoP2oPsPr2 x PyP7PsPs x Po P2 Ps P12.) 

8. The equation x12 + x22 — x32 = O represents a conic for which the triangle of 
reference is self-polar. Verify Pascal’s theorem as applied to the inscribed hexagon 


(0, 1, 1) (0, —1, 1) Ci, 0, 1) (—1, 9, 1) (3, 4, 5) , 3, 5). 
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14.8 PROJECTIVE SPACE 


Our Geometry is an abstract Geometry. The reasoning could be fol- 
lowed by a disembodied spirit who had no idea of a physical point; 
just as a man blind from birth could understand the Electromagnetic 
Theory of Light. 


H. G. Forder [1, p. 43] 


Axiom 14.12 had the effect of restricting the geometry to a single plane. 
If we remove this restriction, we must know exactly what we mean by a 
plane. First we define a flat pencil to be the set of lines joining a range of 
points (on a line) to another point. Then we define a plane to be the set of 
points on the lines of a flat pencil and the set of lines joining pairs of these 
points. Accordingly we replace Axiom 14.12 by three new axioms. The first 
(which may be regarded as a projective version of Pasch’s axiom, 12.27) al- 
lows us to forget the role of a particular flat pencil in the definition of a plane. 
The second enables us to speak of more than one plane. The third (cf. 
12.431) restricts the number of dimensions to three. 


AXIOM 14.81 If A, B, C, D are four distinct points such that AB meets 
CD, then AC meets BD. 


AXIOM 14.82 There is at least one point not in the plane ABC. 


AXIOM 14.83 Any two planes meet in a line. 


We now have a different principle of duality: points, lines and planes cor- 
respond to planes, lines and points (cf. § 10.5). Two intersecting lines, a 
and b, determine a point a: b and a plane ab; these are dual concepts. Two 
lines that do not intersect are said to be skew. The theory of collineations 
and correlations [Coxeter 3, pp. 63-70] is analogous to the two-dimensional 
case, except that the number of self-conjugate points on a line is no longer 
restricted to 0, 1,or2. In fact, instead of two kinds of polarity we now have 
four: one “elliptic,” having no self-conjugate points, two “hyperbolic,” whose 
self-conjugate points form a quadric (nonruled or ruled), and one, the null 
polarity (or “null system’’), in which every point in space is self-conjugate! 

The idea of defining a quadric as the locus of self-conjugate points in a 
three-dimensional polarity (of the second or third kind) is due to von Staudt. 
Another approach, using a two-dimensional polarity in an arbitrary plane 
w, was devised by F. Seydewitz.* The quadric appears as the locus of the 
point 

PA: Qa, 
where P and Q are fixed points (on the quadric) while A is a variable point 
on w and ais its polar. This definition allows the quadric to degenerate to 


a cone or a pair of planes. 
To sample the flavor of solid projective geometry, let us consider a few 


* Archiv fir Mathematik und Physik, 9 (1848), p. 158. 
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Figure 14.80 


typical theorems. Suppose a complete quadrangle PORS yields a quad- 
rangular set (AD)(BE)(CF)on a line g, asin Figure 14.8a. In another plane 
through g, let the sides of a triangle P’Q’R’ pass through A, B, C, and let 
DP’ meet EQ’ in S’. Theorem 14.42 tells us that S’ lies on R’F. This re- 
mark yields two interesting configurations: one consisting of eight lines (Fig- 
ure 14.85), and the other of two mutually inscribed tetrahedra. 


GALLUCCI'S THEOREM. § If three skew lines all meet three other skew lines, 
any transversal to the first set of three meets any transversal to the second Set. 
Proof.. Let the two sets of lines be PQ’, P’'O, RS; PQ, P’Q’, R’S. This 
notation agrees with Figure 14.8a, for, since PS and Q’R’ both pass through 
A, PQ’ meets R’S, and since QS and R’P’ both pass through B, P’Q meets 
R’S. The transversal from R to PQ’ and P’Q is 
RPO: RP’'OQ = REQ’: RDP’ = RS". 
The transversal from R’ to PO and P’Q is 
R'PO-R'P'O = RFQ: R'FO = R'F. 


Since S’ lies on R’F, these transversals meet, as desired. 
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MOBIUS'S THEOREM. If the four vertices of one tetrahedron lie respectively 
in the four face planes of another, while three vertices of the second lie in three 
face planes of the first, then the remaining vertex of the second lies in the re- 
maining face plane of the first. 

Proof. Let PQRS’ and P’'Q’R’S be the two tetrahedra, with 


PQ, OR, Ss, UP, OG, CS 
in the respective planes 
O'R'S, P’R’S, P’Q’S, P'O’R’, ORS’, PRS’, POR, 


as in Figure 14.84. Since R’S’ passes through F, on PO, the remaining ver- 
tex R’ lies in the remaining plane PQS’, as desired. 


Changing the notation from 


S, P, Q, R, P’, 0, R’, Ss” 
to 
S, S14, Soa, Ssa, Soa, Sis, Siz, S134, 


we deduce the first of a remarkable “chain” of theorems due to Homersham 
Cox:* 


COX’S FIRST THEOREM. = Let 04, 62, 03, 04 be four planes of general position 
through a point S. Let Si; be an arbitrary point on the line 0;°0;. Let oi, 
denote the plane SiySixSjx. Then the four planes 0934, 0134, 0124, 0123 all pass 
through one point S234. 


Clearly, 01, 02, 03, 0123 are the face planes of the tetrahedron P’Q’R’S, 
while 6234, 6134, 0124, 04 are those of the inscribed-circumscribed tetrahedron 
PQORS’. Let os be a fifth plane through S. Then Sis, Sos, S35, S45 are four 
points in 05; oj5 is a plane through the line Sj5Sj5; and Sijx5 is the point 
045° Oi%5 * Ojx5. By the dual of Cox’s first theorem, the four points S345, 
Si1345, Sie45, S1235 all lie in one plane. Interchanging the roles of o4 and os, 
we see that Sjo34 lies in this same plane S2345S134551245, Which we naturally 
call 012345. Hence 


COX'S SECOND THEOREM. Lef 01,... , 05 be five planes of general posi- 
tion through S. Then the five points $2345, S345, $1245, S'1235, S234 all lie in 
one plane 042345. 

Adding the extra digits 56 to all the subscripts in the first theorem, we 
deduce 

COXx’s THIRD THEOREM. The Six planes 023456, 913456, 912456, 912356, 912346, 
012345 all pass through one point S'103456- 

The pattern is now clear: we can continue indefinitely. “Cox’s (d—3)rd 


* Quarterly Journal of Mathematics, 25 (1891), p. 67. See also H. W. Richmond. Journal of 
the London Mathematical Society, 16 (1941), pp. 105-112, and Coxeter, Bulletin of the American 
Mathematical Society, 56 (1950), p. 446. When we describe four planes through a point as being 
“of general position,” we mean that their six lines of intersection are all distinct. 
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theorem” provides a configuration of 24-1 points and 24-1 planes, with d of 
the planes through each point and d of the points in each plane. 


Our next result would be difficult to obtain without using coordinates. 
Since the equation of the general quadric 


€11X1? +... 4 CaaX4? + 2012X1Xe9 +... 4+ 2034X3X4 -—0 


has 4 + 6 = 10 terms, a unique quadric = = 0 can be drawn through nine 
points of general position; for, by substituting each of the nine given sets 
of xs in > = 0, we obtain nine linear equations to solve for the mutual 
ratios of the ten c’s. Similarly, a “pencil” (or singly infinite system) of 
quadrics 

s+ pa’ =0 


can be drawn through eight points of general position, and a “bundle” (or 
doubly infinite system) of quadrics 


= + pd’ + pd” = 0 


can be drawn through seven points of general position. But, by solving the 
simultaneous quadratic equations 


T=0, Y=0, 3” =0 


for the mutual ratios of the four x’s, we obtain eight points of intersection 
for these three quadrics. Naturally these eight points lie on every quadric 
of the bundle. Hence 


Seven points of general position determine a unique eighth point, such that 
every quadric through the seven passes also through the eighth. 


This idea of the eighth associated point provides an alternative proof for 
Cox’s first theorem (and therefore also for the theorems of Mébius and 
Gallucci). Let Sie34 be defined as the common point of the three planes 
0234, 0134, 0124. (The theorem states that Sio34 lies also on 0123.) Since the 
plane pairs 010234, 020134, 030124 form three degenerate quadrics through 
the eight points 


S, S14, Soa, S3a, So3, S13, S12, Si23a, 


these are eight associated points. The first seven belong also to the plane 
Pair 040103. Since S1234 does not lie in o4, it must lie in 0,93, as desired. 


The locus of lines meeting three given skew lines is called a regulus. Gal- 
lucci’s theorem shows that the lines meeting three generators of the regulus 
(including the original three lines) form another “associated” regulus, such 
that every generator of either regulus meets every generator of the other. 
The two reguli are the two systems of generators of a ruled quadric. 

Let a1, bi, c1, d; be four generators of the first regulus, and de, bz, ce, de 
four generators of the second, as in Figure 14.8c. The three lines 


a3 = bic2° bec1, bz = C1d2° C201, C3 = Arbe+ a2b1 
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ay bo cg 


Figure 14.8c Figure 14.8d 


evidently form a triangle whose vertices are a1‘d2, bi*be, c1*¢e. G. P. 
Dandelin, in 1824, coined the name hexagramme mystique for the skew 
hexagon a boc ,d2b1C2. Taking the section of its sides by a plane 6 of gen- 
eral position, he obtained a plane hexagon A1B.C1A2BiC2 whose sides 
A Bo, BoCi, ... lie in the planes aybo, bec1, .. . (Figure 14.8d). The points 
of intersection of pairs of opposite sides, namely, 


A3 = BiCy- BoCy, Bz = CyAg-CoA1, C3 = A1Bo* AoBi, 


each lying in both the planes a3b3c3 and 6, are collinear. By allowing c2 to 
vary while the remaining sides of the skew hexagon remain fixed, we see 
from the Braikenridge-Maclaurin construction (which is the converse of 
Pascal’s theorem, Figure 14.7c), that 

The section of a ruled quadric, by a plane of general position, is a conic. 

If 5, instead of being a plane of general position, is the plane dd, the 
vertices of the hexagon A1BpC,A2B Co line alternately on dy and dj, as in 
Axiom 14.15. Thus Pappus’s theorem may be regarded as a “degenerate” 
case of Pascal’s theorem. In fact, instead of assuming Pappus’s theorem 
and deducing Gallucci’s theorem, we could have taken the latter as an axiom 
and deduced the former. Bachmann [1, p. 254] gives a particularly fine fig- 
ure to illustrate this deduction. 


EXERCISES 


1. Ifa and 5 are two skew lines and R is a point not on either of them, Ra+ Rb is 
the only transversal from R to the two lines. 

2. Any plane through a generator of a ruled quadric contains another generator. 
(Such a plane is a tangent plane.) Any other plane section of the ruled quadric is a 
conic. 

3. Iftwo tetrahedra are trebly perspective they are quadruply perspective (cf. § 14.3). 
More precisely, if 414243A4 is perspective with each of BoB, ByB3, B3B4B, Bo, 
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B,B3BoB,, it is also perspective with B,B.B3By. (Hint: Since A,B; meets AjB;, A,B; 
must meet A;B;.) 

4. The four centers of perspective that were implied in Ex. 3 form a third tetra- 
hedron which is perspective with either of the first two from each vertex of the remain- 
ing one. 

5. In the finite space PG(Q3, 3), which has 4 points on each line, there are altogether 
40 points, 40 planes, and how many lines? 


14.9 EUCLIDEAN SPACE 


The set of lines drawn from the artist's eye to the various points of 
the object . . . constitute the projection of the object and are called 
the Euclidean cone. Then the section of this cone made by the can- 
vas is the desired drawing. .. . Parallel lines in the object converge 
in the picture fo the point where the canvas is pierced by the line from 
the eye parallel to the given lines, 


S.H. Gould [1, p. 299] 


The elementary approach to affine space is to regard it as Euclidean space 
without a metric; the elementary approach to projective space is to regard 
it as affine space plus the plane at infinity and then to ignore the special 
role of that plane. It is equally effective to begin with projective space and 
derive affine space by specializing any one plane, calling it the plane at in- 
finity. (This is still, of course, a projective plane.) Each affine concept has 
its projective definition: for example, the midpoint of AB is the harmonic 
conjugate, with respect to A and B, of the point at infinity on AB [Coxeter 
2, p. 119]. We then derive Euclidean space by specializing one elliptic po- 
larity in the plane at infinity, calling it the absolute polarity. Two lines are 
orthogonal if their points at infinity are conjugate in the absolute polarity; 
a line and a plane are orthogonal if the point at infinity on the line is the 
pole of the line at infinity in the plane. A sphere is the locus of the point 
of intersection of a line through one fixed point and the perpendicular plane 
through another; thus it is a special quadric according to Seydewitz’s defi- 
nition. Two segments with a common end are congruent if they are radii 
of the same sphere [Coxeter 2, p. 146]. 

When we use projective coordinates (x1, X2, x3, X4), referred to an arbi- 
trary tetrahedron 


(1, 0, 0, 0) (0, 1, 0, 0) (0, 0, 1, 0) (0, 0, 0, 1), 


it is convenient to take the plane at infinity to be x; = 0. Any other equa- 
tion becomes an equation in affine coordinates x1, x2, x3 by the simple de- 
vice of setting x4 = |. In affine terms, the tetrahedron of reference for the 
projective coordinates is formed by the origin and the points at infinity on 
the three axes. Finally, we pass from affine space to Euclidean space by 
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declaring that two points (1, x2, X3) and (1, ye, y3) are in perpendicular 
directions from the origin if they satisfy the bilinear equation 


Xi + X2ve + Xay3 = 0, 
that is, if the points at infinity 
(x1, X2, X3,0) and (1, ye, ¥3, 0) 


are conjugate in the absolute polarity. 

All the theorems that we proved in § 14.8 remain valid in Euclidean space. 
An interesting variant of Cox’s chain of theorems can be obtained by means 
of the following specialization. Instead of an arbitrary point on the line 
0; ° oj, we take Sj; to be the second intersection of this line with a fixed sphere 
through S. Since the sphere is a quadric through the first seven of the eight 
associated points 


S, Sia, Soa, S34, S23, S13, Sie, S034, 


it passes through Sio34 too, and similarly through Sje35 and the rest of the 
24-1 points. The 24-1 planes meet the sphere in 24-! circles, which remain 
circles when we make an arbitrary stereographic projection, asin §6.9. We 
thus obtain Clifford’s chain of theorems* in the inversive (or Euclidean) 
plane. 


CLIFFORD’S FIRST THEOREM. = Let 01, 09, 03, 04 be four circles of general po- 
sition through a point S. Let Sy be the second intersection of the circles 0; 
and o;. Let oi; denote the circle SijSix,Sjx. Then the four circles 0234, 0134, 
0124, 0123 all pass through one point Sy234. 


CLIFFORD’S SECOND THEOREM. Let o5 be a fifth circle through S. Then 
the five points S2345, 91345, 91245, $1235, S234 all lie on one circle 049345. 


CLIFFORD’S THIRD THEOREM. The SIX circles 023456. 913456, 912456, 912356, 
012346, 912345 all pass through one point S'123456- 


And so on! 


EXERCISES 


1. Why is the absolute polarity elliptic? 
2. Draw acareful figure for Clifford’s first theorem. 


3. The circumcircles of the four triangles formed by four general lines all pass 
through one point (cf. Ex. 2 at the end of § 5.5). 


4. The circumcenters of the four triangles of Ex. 3 all lie on a circle which passes also 
through the point of concurrence of the four circumcircles [Forder 3, pp. 16-22; Baker 
1, p. 328]. 


*W.K. Clifford, Mathematical Papers (London, |882), p. 51. Apvarently Clifford did not 
state these theorems in their full generality. Instead of circles through S he took 01, o2,... to 
be straight lines. In other words, he took S$ to be the point at infinity of the inversive plane. 
Thus his special form of the theorems could have been derived from the configuration of circles 
on the sphere by taking the center of the stereographic projection to be the point S on the sphere 
[Baker 1, p. 133].- 
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Absolute geometry 


in the present chapter we shall re-examine the material of some of the 
earlier chapters in the light of the axiomatic approach outlined in Chapter 
12, regarding classical geometry as ordered geometry enriched with the 
axioms of congruence 15.11-15.15, the last of which is a restatement of 1.26. 
Except in §§ 15.6 and 15.8, we shall work in the domain of absolute geometry, 
that is, we shall take care not to assume any form of Euclid’s fifth postu- 
late. Accordingly, our results will be valid not only in Euclidean geometry 
but also in the non-Euclidean geometry of Gauss, Lobachevsky, and Bolyai. 

In § 15.4 we shall give a simple account of the complete enumeration of 
finite groups of isometries. According to Weyl [1, p. 79], “This is the mod- 
ern equivalent to the tabulation of the regular polyhedra by the Greeks.” 
The relevance of these kinematical results to crystallography makes it na- 
tural, in § 15.6, to reintroduce the full machinery of Euclidean geometry. 
But in § 15.7 we shall return to absolute geometry for a discussion of finite 
groups generated by reflections. Many of the methods used remain valid 
also in spherical geometry. 


15.1 CONGRUENCE 


Every teacher certainly should know something of non-Euclidean ge- 
ometry. .. . lt forms one of the few parts of mathematics which . . . 
is talked about in wide circles, so that any teacher may be asked about 
if at any moment. 


F. Klein (2, p. 135] 


To give a rigorous approach to absolute geometry, we begin with ordered 
geometry (Chapter 12) and introduce congruence as a third primitive concept: 
an undefined equivalence relation among point pairs (or segments, or inter- 
vals). We use the notation AB = CD to mean “AB is congruent to CD.” 
The following axioms are those of Pasch with some refinements due to Hil- 
bert and R. L. Moore [see Kerékjarté 1, pp. 90-101]. 
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Axioms of Congruence 


15.11 If A and B are distinct points, then on any ray going out from C 
there is just one point D such that AB = CD. 


15.12 I[fAB=CD and CD= EF, then AB = EF. 
15.13 AB= BA. 


15.14 If[ABC] and|{A'B’C’] and AB= A’'B’ and BC = BC, then 
ACZ=AC. 

15.15 I[fABC and A’B'C are two triangles with BC = BC’, CA = CA’, 
AB = A’B’, while D and D’ are two further points such that [BCD] and [B’C'D’] 
and BD = B'D’, then AD = A'D’. 

By two applications of 15.13, we have AB = AB; that is, congruence is re- 
flexive. From 15.1] and 15.12 we easily deduce that the relation AB = CD 
implies CD = AB; that is, congruence is symmetric. Axiom 15.12 itself says 
that congruence is transitive. Hence congruence is an equivalence relation. 
This result, along with the additive property of 15.14, provides the basis fora 
theory of length [Forder 1, p. 95]. Axiom 15.15 enables us to extend the re- 
lation of congruence from point pairs or segments to angles [Forder 1, p. 132]. 

We follow Euclid in defining a right angle to be an angle that is congruent 
to its supplement; and we agree to measure angles on such a scale that the 
magnitude of a right angle is 47. 

The statement 4B = CD for segments is clearly equivalent to the state- 
ment AB = CD for lengths, so no confusion arises from using the same sym- 
bol AB for a segment and its length. A similar remark applies to angles. 

The circle with center O and radius r is defined as the locus of a variable 
point P such that OP = r. A point Q such that OQ > ris said to be out- 
side the circle. Points neither on nor outside the circle are said to be inside. 
It can be proved [Forder 1, p. 131] that if a circle with center A has a point in- 
side and a point outside a circle with center C, then the two circles meet in 
just one point on each side of the line AC. Euclid’s first four postulates 
may now be treated as theorems, and we can prove all his propositions as 
far as 1.26; also I.27 and 28 with the word “parallel” replaced by “noninter- 
secting.” We can define reflection as in § 1.3, and derive its simple conse- 
quences such as pons asinorum (Euclid I.5) and the symmetry of a circle 
about its diameters (III.3; see § 1.5). But we must be careful to avoid any ap- 
peal to our usual idea about the sum of the angles of a triangle; for example, 
we can no longer assert that angles in the same segment of a circle are equal 
(Euclid I11.21), Lacking such theorems as VI.2-4, which depend on the af- 
fine properties of parallelism, we have to look for some quite different way 
to prove the concurrence of the medians ofa triangle.* On the other hand, 
the concurrence of the altitudes (of an acute-angled triangle) arises as a by- 
product of Fagnano’s problem, which can still be treated as in § 1.8. (Fer- 
mat’s problem would require a different treatment because we can no longer 


assume the angles of an equilateral triangle to be 7/3.) 
* Rarhmann 1 nn T4746 
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EXERCISES 


1. Complete the proof that congruence is symmetric: if AB = CD then CD= AB. 
2. How much of § 1.5 remains valid in absolute geometry? [Kerékjarto 1, pp. 161- 
163.] (See especially Exercises | and 3.) 


3. For any simple quadrangle inscribed in a circle, the sum of two opposite angles 
is equal to the sum of the remaining two angles [Sommerville 1, p. 84]. 


15.2 PARALLELISM 


| have resolved to publish a work on the theory of parallels as soon as | 
have put the material in order... . The goal is not yet reached, but 
| have made such wonderful discoveries that | have been almost over- 
whelmed by them... . | have created a new universe from nothing. 


Jonos Bolyai (1802 -1860) 
(From a letter to his father in 1823) 


Following Gauss, Bolyai, and Lobachevsky, we say that two lines are 
parallel if they “almost meet.” For the precise meaning of this phrase, see 
§ 12.6. (We use the notation p; for one of the two rays into which the line 
p is decomposed by a point that lies on it.) 

The idea of the imcenter (§ 1.5) may be extended from a triangle to the 
figure formed by two parallel lines and a transversal, enabling us to prove 
that parallelism is symmetric: 


Figurel 5.2a 


15.21 If py is parallel to qx, then qy is parallel to pi. 


Proof: [Sommerville 1, p. 32]. Ifp1, through A, is parallel to qi, through 
B, as in Figure 15.2a, the internal bisector AD of the angle at A completes a 
triangle ABD. Let the internal bisector of B meet 4D in J. Draw perpen- 
diculars lJ, IK, IL, to p1, AB, qi. Reflecting in JA and JB, we see that lJ 
=IK=IJL. Letri be the internal bisector of 2 LIJ. Reflection in the line 
r interchanges J and L, and therefore interchanges p and qg. Since p is paral- 
lel to g, it follows that g is parallel to p in the same sense, that is, g1 is parallel 
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to py. (In the terminology of Gauss, J and L are corresponding points on the 
two parallel rays.) 


We can now use the methods of ordered geometry to prove that paral- 
lelism is transitive: 


15.22 If pi is parallel to q,, and qy is parallel to ry, then p, is parallel 
tory. 

Proof [Gauss 1, vol. 8, pp. 205-206]. We have to show that, if p; andr, 
are both parallel to q1, they are parallel to each other. We see at once that pi 
and r; cannot meet; for if they did, we would have two intersecting lines 
p and r both parallel to g in the given sense. By Theorem 12.64, we may 
assume that pi, 41, r1 begin from three collinear points A, B, C. For the 
rest of the proof we distinguish the case in which B lies between A and C 
from the case in which it does not. 


Figure 15.2b Figure 15.2c 


If [ABC], as in Figure 15.2, any ray from A within the angle between AC 
and p; meets gq; (since p, is parallel to gi) and then meets r; (since q is paral- 
leltor,). Therefore pi is parallel to ry. 

If B is not between A and C, suppose for definiteness that [ACB], as in 
Figure 15.2c. Any ray from A within the angle between AC and pi meets 
41, sayin D. Sincer separates A from D, it meets the segment AD. There- 
fore pi is parallel to ry. 

In this second part of the proof we have not used the parallelism of q; and 
ry. In fact, 


15.23 Ifaray ry lies between two parallel rays, it is parallel to both. 


Having proved that parallelism is an equivalence relation, we consider the 
set of lines parallel to a givenray. We naturally call this a pencil of parallels, 
since it contains a unique line through any given point [Coxeter 2, p. 5]. 
Pursuing its analogy with an ordinary pencil (consisting of all the lines 
through a point), we may also call it a point at infinity or, following Hilbert, 
anend. Instead of saying that two rays (or lines) are parallel, or that they be- 
long to a certain pencil of parallels M, we say that they have M for a common 
end. In the same spirit, the ray through A that belongs to the given pencil of 
parallels is denoted by AM, as if it were a segment; the same symbol AM can 
also be used for the whole line. 
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Let AM, BM be parallel rays, and ean arbitrarily small angle. Within the 
angle BAM (Figure 15.2d), take a ray from A making with AM an angle 
less thane. This ray cuts BM in some point C. On CM (which is C/B), 
take D so that CD = CA. The isosceles triangle CAD yields 


LADC = £CAD < ZCAM <e. 


Hence, when BD tends to infinity, so that AD tends to the position AM, 
Z ADB tends to zero. 
This conclusion motivates the following assertion of Bolyai [1, p. 207]: 


15.24 When two parallel lines are regarded as meeting at infinity, the 
angle of intersection must be considered as being equal to zero. 


A ae is L 
a Pann 
— ee 
B C D B Cc D 


Figure 15.2d Figure 15.2e 


When 4M and BM are parallel rays, we call the figure ABM an asymp- 
totic triangle. Such triangles behave much like finite triangles. In particu- 
lar, two of them are congruent if they agree in the finite side and one angle 
[Carslaw 1, p. 49]: 


15.25 If two asymptotic triangles ABM, A’B'M’ have AB = A'B’ and 
A = A’, then also B = B’. 


It is a consequence of Axiom 15.11 that, if two lines have a common per- 
pendicular, they do not intersect. The following theorem provides a kind 
of converse for this statement. 


15.26 If two lines are neither intersecting nor parallel, they have a com- 
mon perpendicular. 

Proof. From A on the first line AL, draw AB perpendicular to the sec- 
ond line BM, as in Figure 15.2e. If AB is perpendicular to AL there is no 
more to be said. If not, suppose L is on that side of AB for which 2 BAL 
is acute. Since the two lines are neither intersecting nor parallel, there is a 
smaller angle BAM such that AM is parallel to BM. If [BCD] on BM, we 
can apply Euclid I.16 to the triangle ACD, with the conclusion that the in- 
ternal angle at D is less than the external angle at C. Hence, when BD in- 
creases from 0 to oo, so that 2 DAL decreases from 7 BAL to “MAL, 
/Z ADB decreases from a right angle to zero. At the beginning of this process 
we have 

£DAL < ZADB 


(since 7 BAL is acute); but at the end the inequality is reversed (since 
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Z MAL is positive). Hence there must be some intermediate position for 
which 
ZDAL = LADB. 


(To be precise, we can apply Dedekind’s axiom 12.51 to the points on BM 
satisfying the two opposite inequalities.) For such a point D (Figure 15.2f) 
we obtain two triangles OAE, ODF by drawing EF perpendicular to BD 
through O, the midpoint of AD. Since these triangles are congruent, EF is 
perpendicular not only to BD but also to AL. 


Nonintersecting lines that are not parallel are said to be u/traparallel (or 
“hyperparallel’”’). We are not asserting the existence of such lines, but merely 
showing how they must behave if they do exist. 


A E 
L 
oO 
a M 
B F D 


Figure 15.2f 


EXERCISES 


1. Prove 15.25 without referring to Carslaw 1. 

2. Give a complete proof that, if two lines have a common perpendicular, they do 
not intersect. 

3. Example 4 on p, 16 remains valid when A is an end so that the triangle is asymp- 
totic. 


15.3 ISOMETRY 


Beside the actual universe | can set in imagination other universes in 
which the laws are different. 


J. L. Synge [2, p. 21] 


The whole theory of finite groups of isometries (§§ 2.3-3.1) belongs to 
absolute geometry, because it is concerned with isometries having at least 
one invariant point. The first departure from our previous treatment (§ 3.2) 
is in the discussion of isometries without invariant points. We must now 
distinguish between a translation, which is the product of half-turns about 
two distinct points, and a parallel displacement, which is the product of re- 
flections in two parallel lines. 

The product of half-turns about two distinct points O, O’ is a translation 
along a given line (called the axis of the translation) in a given sense through 
a given distance, namely, along OO’ in the sense of the ray O’/O through 
the distance 200’. Since a translation is determined by its axis and directed 
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distance, the product of half-turns about O, O’ is the same as the product 
of half-turns about Q, Q’, provided the directed segment QQ’ is congruent 
to OO’ on the same line (Figure 3.2a). If P is on this line, the distance PP™ 
is just twice OO’. (If not, it may be greater!) 

By the argument used in proving 3.21, the product of two translations 
with the same axis, or with intersecting axes, is a translation. (It is only in 
the former case that we can be sure of commutativity.) More precisely, we 
have 


15.31 (Donkin’s theorem*) The product of three translations along the 
directed sides of a triangle, through twice the lengths of these sides, is the 
identity. 


We shall see later that the product of two translations with nonintersect- 
ing axes may be a rotation. 

By the argument used in proving 3.22, if two lines have a common per- 
pendicular, the product of reflections in them is a translation along this com- 
mon perpendicular through twice the distance between them. (Such lines 
may be either parallel or ultraparallel according to the nature of the geom- 
etry.) 

Again, as in 3.13, every isometry is the product of at most three reflec- 
tions. If the isometry is direct, the number of reflections is even, namely 2. 
It follows from 15.26 that 


15.32 Every direct isometry (of the plane) with no invariant point is either 
a parallel displacement or a translation. 


It is remarkable that absolute geometry includes the whole theory of glide 
reflection. The only changes needed in the previous treatment (§ 3.3) are 
where the word “parallel” was used. (In Figure 3.35 we must define m, m’ 
as being perpendicular to OO’; they are not necessarily parallel to each 
other.) As an immediate application of these ideas we have Hjelmslev’s 
theorem, which is one of the best instances of a genuinely surprising result 
belonging to absolute geometry. The treatment in § 3.6 remains valid with- 
out changing a single word! 

Likewise, the one-dimensional groups of § 3.7 belong to absolute geom- 
etry, the only change being that again the mirrors m, m’ (Figure 3.75) should 
not be said to be “parallel” but both perpendicular to the same (horizontal) 
line. On the other hand, the whole theory of lattices (Chapter 4) and of 
similarity (Chapter 5) must be abandoned. 

The extension of absolute geometry from two dimensions to three presents 
no difficulty. In particular, much of the Euclidean theory of isometry (§ 7.1) 
remains valid in absolute space. It is still true that every direct isometry 
is the product of two half-turns, and that every opposite isometry with 


*W, F, Donkin, On the geometrical theory of rotation, Philosophical Magazine (4), 1, (1851), 
187-192, Lamb [1, p. 6] used half-turns about the vertices 4, B, C of the given triangle to con- 
struct three new triangles which, he said, “are therefore directly equal to one another, and ‘sym- 
metrically’ equal to ABC.” This was a mistake: all four triangles are directly congruent! 
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an invariant point is a rotatory inversion (possibly reducing to a reflection 
or to a central inversion). Moreover, the classical enumeration of the five 
Platonic solids (§§ 10.1-10.3) is part of absolute geometry. The few neces- 
sary changes are easily supplied; for example, the term rectangle must be 
interpreted as meaning a quadrangle whose angles are all equal (though not 
necessarily right angles), and a square is the special case when also the sides 
are equal. 


EXERCISES 


1. If/is a line outside the plane of a triangle A BC, what can be said about the three 
lines in which this plane meets the three planes A/, Bl, C1? (If two of the three lines 
intersect, or are parallel, or have a common perpendicular. the same can be said of all 
three. This property of three lines 11, me, m3 is equivalent to RyR2R3 = RgRe2R, in 
the notation of § 3.4.) 

2. The product of reflections in the lines p and r of Figure 15.2a is a parallel dis- 
placement which transforms J into L. 


15.4 FINITE GROUPS OF ROTATIONS 


These groups, in particular the last three, are an immensely attractive 
subject for geometric investigation. 


H. Weyl [1, p. 79] 


One of the simplest kinds of transformation is a permutation (or rearrange- 
ment) of a finite number of named objects. For instance, one way to per- 
mute the six letters a, b, c, d, e, fis to transpose (or interchange) a and 5, 
to change c into d, dinto e, e into c, and to leave funaltered. This permu- 
tation is denoted by (a b)(c de). The two “independent”’ parts, (a b) and 
(c de), are called cycles of periods 2 and 3. A permutation that consists of 
just one cycle is said to be cyclic. Clearly, the cyclic group C, may be rep- 
resented by the powers of the generating permutation (aide... dn); for in- 
stance, the four elements of C4 are 


1, (abcd, (acybd), (adc b). 

A cyclic permutation of period 2, such as (a 5), is called a transposition. 

Since 
(@1@2... Qn) = (G1dn)(A2An)..., 

any permutation may be expressed as a product of transpositions. A per- 
mutation is said to be even or odd according to the parity of the number of 
cycles of even period; for instance, (a c)(b d) is even, but (a b)(e de) is odd. 
The identity, 1, has no cycles at all, and is accordingly classified as an even 
permutation. It is easily proved [see Coxeter 1, pp. 40-41] that every prod- 
uct of transpositions is even or odd according to the parity of the number 
of transpositions. It follows that the multiplication of even and odd per- 
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mutations behaves like the addition of even and odd numbers; for example, 
the product of two odd permutations is even. 

It follows also that every group of permutations either consists entirely 
of even permutations or contains equal numbers of even and odd permuta- 
tions, The group of all permutations of m objects is called the symmetric 
group of order n! (or of degree n) and is denoted by S,. The subgroup con- 
sisting of all the even permutations is called the alternating group of order 
4n! (or of degree n) and is denoted by 4,. In particular, S, is the same 
group as Ce, and A; the same as C3, so we write 


Se = Co, Az =& C3. 


More interestingly, Ss; 2+ D3 (see Figure 2.7a). For, the six elements of 
the dihedral group Ds, being symmetry operations of an equilateral triangle, 
may be regarded as permutations of the three sides of the triangle. The 
even permutations 


1, (abc), (acb) 


(which form the subgroup A3 = C3) are rotations, whereas the odd permu- 
tations 
(bc), (ca), (ab) 


are reflections in the three medians. If we regard the triangle as lying in 
three-dimensional (absolute) space, the rotations are about an axis through 
the center of the triangle, perpendicular to its plane. The reflections may 
then be interpreted in two alternative ways, yielding two groups which are 
geometrically distinct but abstractly identical or isomorphic: we may either 
reflect in three planes through the axis or rotate through half-turns about 
the medians themselves. In the latter representation, all the six elements 
of D3 appear as rotations. We may describe this as the group of direct sym- 
metry operations of a triangular prism. More generally, the 2 direct sym- 
metry operations of an n-gonal prism form the dihedral group D,, whereas 
of course the m direct symmetry operations of an n-gonal pyramid form 
the cyclic group C,. The rotations of C, all have the same axis, and D,, is 
derived from C, by adding half-turns about x lines symmetrically disposed 
in a plane perpendicular to that axis. 

We have thus found two infinite families of finite groups of rotations. 
Other such groups are the groups of direct symmetry operations of the five 
Platonic solids {p, gq}. These are only three groups, not five, because any 
rotation that takes { p, q} into itself also takes the reciprocal {q, p} into it- 
self: the octahedron has the same group of rotations as the cube, and the 
icosahedron the same as the dodecahedron. 

The regular tetrahedron {3, 3} is evidently symmetrical by reflection in 
the plane that joins any edge to the midpoint of the opposite edge. Asa 
permutation of the four faces a, b, c, d (Figure 15.4a), this reflection is just 
a transposition. Thus the complete symmetry group of the tetrahedron, 
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d 


PA aN 


Figure 15.4a 


being generated by such reflections, is isomorphic to the symmetric group 
S4, which is generated by transpositions; and the rotation group, being gen- 
erated by products of pairs of reflections, is isomorphic to the alternating 
group A4, which is generated by products of pairs of transpositions. The 
12 rotations may be counted as follows. The perpendicular from a vertex 
to the opposite face is the axis of a trigonal rotation (L¢., a rotation of pe- 
riod 3); the 4 vertices yield 8 such rotations. The line joining the midpoint 
of two opposite edges is the axis of a half-turn (or digonal rotation); the 3 
pairs of opposite edges yield 3 such half-turns. Including the identity, we 
thus have 8 + 3 + 1 = 12 rotations. As permutations, the 8 trigonal ro- 
tations are 


(bed), (bdc), (acd), (adc), (abd), (adb), (abc), (acb) 
and the 3 half-turns are 


(bclad), (ca\(bd), (abjcd). 


Figure 15.4b 


The octahedron {3, 4} can be derived from the tetrahedron by truncation: 
its eight faces consist of the four vertex figures of the tetrahedron and trun- 
cated versions of the four faces. Every symmetry operation of the tetra- 
hedron is retained as a symmetry operation of the octahedron, but the octa- 
hedron also has symmetry operations that interchange the two sets of four 
faces. For instance, the line joining two opposite vertices is the axis of a 
tetragonal rotation (of period 4), and the line joining the midpoints of two 
opposite edges is the axis of a half-turn. When the four pairs of opposite 
faces are marked a, b, c, d, as in Figure 15.45, such a half-turn appears as a 
transposition, which is one of the permutations that belong to S4 but not 
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Figure 15.4¢ 


to Aq. It follows that the rotation group of the octahedron (or of the cube) 
is isomorphic to the symmetric group S4. 

In Figure 15.4c, the twenty faces of the icosahedron {3, 5} have been 
marked a, b, c, d, e in sets of four, in such a way that two faces marked alike 
have nothing in common, not even a vertex. In fact, the four a’s (for in- 
stance) lie in the planes of the faces of a regular tetrahedron, and the re- 
spectively opposite faces (marked 5, c, d, e) form the reciprocal tetrahedron. 
The twelve rotations of either tetrahedron into itself (represented by the 
even permutations of b, c, d, e) are also symmetry operations of the whole 
icosahedron. This behavior of the four a’s is imitated by the b’s, c’s, a’s 
and e’s, so that altogether we have all the even permutations of the five let- 
ters: the rotation group of the icosahedron (or of the dodecahedron) is iso- 
morphic to the alternating group As. The 60 rotations may be counted as 
follows: 4 pentagonal rotations about each of 6 axes, 2 trigonal rotations 
about each of 10 axes, | half-turn about each of 15 axes, and the identity 
[Coxeter 1, p. 50]. 

We shall find that the above list exhausts the finite groups of rotations. 
As a first step in this direction, we observe that all the axes of rotation must 
pass through a fixed point. In fact, we can just as easily prove a stronger 
result: 


15.41 Every finite group of isometries leaves at least one point invariant. 


Proof. A finite group of isometries transforms any given point into a finite 
set of points, and transforms the whole set of points into itself. This, like 
any finite (or bounded) set of points, determines a unique smallest sphere that 
contains all the points on its surface or inside: unique because, if there were 
two equal smallest spheres, the points would belong to their common part, 
which is a “lens’’; and the sphere that has the rim of the lens for a great 
circle is smaller than either of the two equal spheres, contradicting our sup- 
position that these spheres are as small as possible. (The shaded area in 
Figure 15.4d is a section of the lens.) The group transforms this unique 
sphere into itself. Its surface contains some of the points, and therefore all 
of them. Its center is the desired invariant point. 
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Figure 15.4d 


It follows that any finite group of rotations may be regarded as operat- 
ing on the surface of a sphere. In such a group G, each rotation, other than 
the identity, leaves just two points invariant, namely the poles where the 
axis of rotation intersects the sphere. A pole P is said to be p-gonal (p > 2) 
if it belongs to a rotation of period p. The p rotations about P, through vari- 
ous multiples of the angle 27/p, are those rotations of G which leave P invari- 
ant. Any other rotation of G transforms P into an “equivalent” pole, which 
is likewise p-gonal. Thus all the poles fall into sets of equivalent poles. All 
the poles in a set have the same period p, but two poles of the same period 
do not necessarily belong to the same set; they belong to the same set only 
if one is transformed into the other by a rotation that belongs to G. 

Any set of equivalent p-gonal poles consists of exactly n/p poles, where n 
is the order of G. To prove this, take a point Q on the sphere, arbitrarily 
near to a pole P belonging to the set. The p rotations about P transform 
Q into a small p-gon round P. The other rotations of G transform this p- 
gon into congruent p-gons round all the other poles in the set. But the n 
rotations of G transform Q into just n points (including Q itself). Since 
these ” points are distributed into p-gons round the poles, the number of 
poles in the set must be n/p, 

The n — 1 rotations of G, other than the identity, consist of p — 1 for 
each p-gonal axis, that is, 4 (p — 1) for each p-gonal pole, or 


3(p — \)n/p 
for each set of n/p equivalent poles. Hence 
n—-l=4n S(p- ly, 


where the summation is over the sets of poles. This equation may be ex- 
pressed as 


Ifn = 1, so that G consists of the identity alone, there are no poles, and the 
sum on the right has no term. In all other cases m > 2, and therefore 


2 
}< ee ts 
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It follows that the number of sets of poles can only be 2 or 3; for, the single 
term | — 1/p would be less than 1, and the sum of 4 or more terms would be 


>41 —-4 =2. 


If there are 2 sets of poles, we have 


n Pl P2 
that is, zt + A-2, 
Pri pe 


But two positive integers can have the sum 2 only if each equals 1; thus 


Pi = pan, 


each of the 2 sets of poles consists of one n-gonal pole, and we have the 
cyclic group C,, with a pole at each end of its single axis. 
Finally, in the case of 3 sets of poles we have 


2 1 ] i 
ee | 1 1 ; 
Pr = P2 + P3 
whence 
1 1 1 2 
15.42 Be oe Bh ee] ee 
Pr + P2 $ P3 s n 


Since this is greater than } + 4 + 4 = 1, the three periods p; cannot all be 
3 or more. Hence at least one of them is 2, say p3 = 2, and we have 


l 1 Lo 2 


Pi ? p22 7a? 
whence (p1 — 2)(p2 — 2) = 4(1 — pipe/n) < 4 
(cf. 10.33), so that the only possibilities (with py < pz for convenience) are: 
pi = 2, po =p, n= 2p; Pi = 3, po =3, n= 12; 
pi = 3, po = 4, n = 24; pi = 3, pe = 5, n= 60. 


We recognize these as the dihedral, tetrahedral, octahedral and icosahedral 
groups. 
This completes our proof [Klein 3, p. 129] that 


15.43 The only finite groups of rotations in three dimensions are the cyclic 
groups C, (p = 1,2,...), the dihedral groups D,(p = 2, 3,...), the tetra- 
hedral group Aa, the octahedral group S4, and the icosahedral group As. 

(To avoid repetition, we have excluded D, which, when considered as a 
group of rotations, is not only abstractly but geometrically identical with 
C2.) 

Any solid having one of these groups for its complete symmetry group 
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Figure 15.4e 


(such as the Archimedean snub cube* shown in Figure 15.4e, whose group 
is S4) can occur in two enantiomorphous varieties, dextro and laeyo (i.e., 
right- and left-handed): mirror images that cannot be superposed by a con- 
tinuous motion. 


EXERCISES 


1. Interpret the following permutations as rotations of the octahedron (Figure 
15.4b): 


(abcd), (abc), (ab), (ab)(cd). 
Count the rotations of each type, and check with the known order of S4. 
2. Using the symbol (pi, pe, ps) for the group having three sets of poles of periods 
P1. P2; ps, consider the possibility of stretching the notation so as to allow (1, p, p) = Cp 
as well as 


Q2, 2, P) = Dp, (2, 3, 3 
(2,3,4)= Ss (2,3, 5 


15.5 FINITE GROUPS OF ISOMETRIES 


Having enumerated the finite groups of rotations, we can easily solve the 
wider problem of enumerating the finite groups of isometries (cf. § 2.7). 
Since every such group leaves one point invariant, we are concerned only 
with isometries having fixed points. Such an isometry is a rotation or a ro- 
tatory inversion according as it is direct or opposite (7.15, 7.41). 

If a finite group of isometries consists entirely of rotations, it is one of 
the groups G considered in § 15.4. If not, it contains such a group G as a 
subgroup of index 2, that is, it is a group of order 2n consisting of n rota- 
tions S;, Se, ..., S, and an equal number of rotatory inversions Tj, 


* The vertices of the snub cube constitute a distribution of 24 points on a sphere for which the 
smallest distance between any 2 is as great as possible, This was conjectured by K. Schiltte and 
B. L, van der Waerden (Mathematische Annalen, 123 (1951), pp. 108, 123) and was proved by 
R. M. Robinson (ibid., 144 (1961), pp. 17-48). The analogous distribution of 6 or 12 points is 
achieved by the vertices of an octahedron or an icosahedron, respectively. For 8 points the fig- 
ure is not, as we might at first expect, a cube, but a square antiprism [Fejes Toth 1, pp. 162-164]. 
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Te, ..., Tn. For, if the group consists of m rotations S$; and (say) m rota- 
tory inversions T;, we can multiply by T, so as to express the same m + m 
isometries as S;T, and T;T;. The isometries S;T1, being rotatory inver- 
sions, are the same as T; (suitably rearranged if necessary), and the m isom- 
etries T;Ti, being rotations, are the same as S;. Therefore m = n. 


If the central inversion I belongs to the group, the m rotatory inversions 
are simply 


Sil = 1S; @ = 1,2,...,n), 


and the group is the direct product G x {I}, where G is the subgroup con- 
sisting of the S’s and {I} denotes the group of order 2 generated by I. (As 
an abstract group, {I} is, of course, the same as C2 or Dj.) 


If I does not belong, the 2” transformations 8; and T,I form a group of 
rotations of order 2% which has the same multiplication table as the given 
group consisting of S; and T;. For, if $;T; = Ty, 

SiTjl = Txl, 
and if Lely = Sk 
TATA = Tyl2T; = TiT; = Sp. 

In other words, a group of n rotations and ” rotatory inversions, not in- 
cluding I, is isomorphic to a rotation group G’ of order 2” which has a sub- 
group G of order x. To complete our enumeration, we merely have to seek 
such pairs of related rotation groups. Each pair yields a “mixed” group, 
say G’G, consisting of all the rotations in the smaller group G, along with 


the remaining rotations in G’ each multiplied by the central inversion I. 
Looking back at § 15.4, we see that the possible pairs are 


ConCn, DnCn, DnDin (neven), S4A4. 
Thus we can complete Table HI on p. 413. 


EXERCISES 


1. Determine the symmetry groups of the following figures: (a) an orthoscheme 
O00 0203 (Figure 10.4c) with O90, = O203; (b) an n-gonal antiprism (” even or odd), 
2. Designate in the G’G notation the direct product of the group of order 3 gener- 


ated by a rotation about a vertical axis and the group of order 2 generated by the re- 
flection in a horizontal plane. 
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15.6 GEOMETRICAL CRYSTALLOGRAPHY 


The sense in which a snail's shell winds is an inheritable character 
founded in its genetic constitution, as is... the winding of the intestinal 
duct in the species Homo sapiens. ... Also the deeper chemical consti- 
tution of our human body shows that we have a screw, a screw that is 
turning the same way in every one of us.*... A horrid manifestation of 
this genotypical asymmetry is a metabolic disease called phenylketo- 
nuria, leading to insanity, that man contracts when a small quantity of 
laevo-phenylalanine is added fo his food, while the dextro- form has no 
such disastrous effects. 


H. Weyl [1, p. 30] 


The discussion of symmetry groups has been phrased in such a way as to 
be valid not only in Euclidean space but in absolute space. However, it 
seems appropriate to mention the application of these ideas to the practical 
science of crystallography. Accordingly, in this digression the geometry is 
strictly Euclidean. 

Crystallographers are interested in those finite groups of isometries which 
arise as subgroups (and factor groups) of symmetry groups of three-dimen- 
sional lattices. By §4.5, these are the special cases in which the only rota- 
tions that occur have periods 2, 3,4 or 6. This crystallographic restriction 
reduces the rotation groups to 


Cy, Ce, C3, Ca, Ce, Do, D3, Da, De, Aa, Sa, 


the direct products to these eleven each multipled by {1}, and the mixed 
groups to 


C2Cy, CaCe, CeC3, DoCe, D3C3, DaCa, DeCo, DaDo, De D3, S4Aa. 


(Of course, C, x {I} is just {T} itself.) 

These 32 groups are called the crystallographic point groups or “crystal 
classes.” Every crystal has one of them for its symmetry group, and every 
group except C.gC3 occurs in at least one known mineral. In the more fa- 
miliar notation of Schoenflies [see, e.g., Burckhardt 1, p. 71], the groups are 
respectively 


Ci, Co, Cz, Ca, Ce, D2, Dz, Da, Do, T, O, 
Ci, Con, Cai, Can, Con, Don, Daa, Dan, Don, Tr, On, 
Cs, Sa, Can, Cov, Cau, Cav, Cov, Dea, Dan, Ta. 


To avoid possible confusion, observe that our CyC, and Sy (“S” for “sym- 
metric’) are Schoenflies’s S4 and O (for “octahedral”). The 32 groups are 
customarily divided into seven crystal systems, as follows: 


Triclinic: Ci, {I}. 


Monoclinic: Co, Ce & {I}, CoC. 
Orthorhombic: De, De X {I}, Deo. 
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Rhombohedral: C3, C3 x {I}, D3, D3 X {I}, DsCs. 
Tetragonal: C4, Cy x {DT}, CeCe, Dg, Da & {I}, DaCa, DgDo. 
Hexagonal: Ce, Ce x {I}, CeCs, De, De X {I}, DeCe, DoD. 
Cubic: Ay, Aa X {I}, Sa, Sa X {I}, S4Aq. 


Table I (on p. 413) is a complete list of the 17 discrete groups of isometries 
in two dimensions involving two independent translations. The analogous 
groups in three dimensions are the discrete groups of isometries involving 
three independent translations. The enumeration of these space groups is 
the central problem of mathematical crystallography. The complete list con- 
tains 65 + 165 = 230 groups. 

The first 65 are composed entirely of direct isometries. Although these 
were enumerated as long ago as 1869 by C. Jordan [see Hilton 1, p. 258], 
they are usually attributed to L. Sohncke who, in 1879, pointed out their ap- 
plication to crystallography. The most obvious group consists of transla- 
tions alone. The remaining 64 of the 65 contain also rotations and screw 
displacements; 22 of them occur in 11 enantiomorphous pairs which are 
mirror images of each other (one containing right-handed screw displace- 
ments and the other the reflected left-handed screw displacements). This 
explains the phenomenon of optical activity [Sayers and Eustace 1, pp. 238- 
241, 248-252]. From the standpoint of pure geometry or pure group theory, 
it would be more natural to ignore this distinction of sense, thus reducing the 
number 65 to 54, and the total of 230 to 219 [Burckhardt 1, p. 161]. 

The remaining 165 groups contain not only direct but also opposite isome- 
tries: reflections, rotatory reflections (or rotatory inversions), and glide reflec- 
tions. Their enumeration, by Fedorov in Russia (1890), Schoenflies in 
Germany (1891), and Barlow in England (1894), provides one of the most 
striking instances of independent discovery in different places using different 
methods. Fedorov, who obtained the 230 as 73 + 54 + 103 instead of 65 + 
165, was probably unaware of the preliminary work of Jordan and Sohncke. 
It is quite certain that Schoenflies knew nothing of Fedorov, and that Bar- 
low’s work was independent of both. 


EXERCISE 


Determine the symmetry groups of the following figures: (a) a rectangular parallel- 
epiped (e.g., a brick), (b) a rhombohedron; (c) a regular dodecahedron with an inscribed 
cube (whose 8 vertices occur among the 20 vertices of the dodecahedron). 


15.7 THE POLYHEDRAL KALEIDOSCOPE 


In combining three reflections... the effect is highly pleasing 


Sir Dovid Brewster (1781 -1848) 
[Brewster 1, p. 93] 


Table III (on p. 413) is a complete list of the finite groups of isometries. 
In the preceding section, we selected from this list those groups which satisfy 
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the crystallographic restriction. Another significant way to make a selection 
(partly overlapping with the previous way) is to pick out those groups which 
are generated by reflections, namely, 


DrCr (a > 1), DeanDn(nodd), Dn X {I} (nm even), 
S4A4, S4 x {I}, As X {I}. 


(We have now returned to absolute geometry!) 

D,C, (Schoenflies’s Cs, which we previously denoted by C2C}) is the group 
of order 2 generated by a single reflection. DCs or D2D, (Schoenflies’s C2,) 
is the group of order 4 generated by two orthogonal reflections. The remain- 
ing groups D,,C,, are the symmetry groups of the n-gonal pyramids. In other 
words, these are the groups D, of § 2.7 in a different notation. (We now re- 
serve the symbol D, for the dihedral group of rotations, which is, of course, 
isomorphic to D,C,. Weyl [1, p. 80] makes the distinction by calling the ro- 
tation group D’, and the mixed group D’,Cy,.) 

Dz X {I} is a group of order 8 (abstractly Ce x C2 x C2) generated by 
three orthogonal reflections. The remaining groups Do,D, (nodd) and 
Dy X {I} (4 even) are the symmetry groups of the n-gonal prisms, or of their 
reciprocals, the dipyramids. 

S4A4, the symmetry group of the regular tetrahedron, is derived from the 
rotation group Aq by adjoining reflections, such as the reflection in the plane 
ABA’B’ (Figure 10.5a) which joins the edge AB to the midpoint of the op- 
posite edge CD. (The product of this reflection and the central inversion is 
the half-turn about the join of the midpoints of the two opposite edges CD’, 
C’D of the cube. This half-turn, which interchanges the two reciprocal tetra- 
hedra ABCD, A’B’C’D’, is one of the twelve rotations in S4 that do not be- 
long to the subgroup A4; thus it illustrates our special meaning for the 
“mixed” symbol $444.) Since the remaining Platonic solids are centrally 
symmetrical, their symmetry groups are simply S4 x {I} and As x {I}. 


For a practical demonstration in Euclidean space, take the two hinged mirrors of § 2.7, inclined 
at 180°/n, which demonstrate the group D,C,. Standing them upright ona separate horizontal 
mirror, we obtain the symmetry group of the n-gonal prism, i.e., the direct product of D,C, and the 
group of order 2 generated by the horizontal reflection. To demonstrate the three remaining 
groups, remove the third mirror, and let the first two stand vertically on the table at an angle of 60°, 
as in the demonstration of D3C3. Now hold the third mirror obliquely, with its horizontal edge / 
on the table top at right angles to one of the vertical mirrors and touching the front lower corner of 
the other. Gradually rotating this third mirror about its edge / from an almost horizontal posi- 
tion (by raising its nearer edge, opposite to /), we observe at a certain stage two faces of a regu- 
lar tetrahedron {3, 3}. Each face is subdivided into six right-angled triangles, one of which is 
actually the exposed portion of the table top. Ata later stage we see three faces of an octahedron 
{3, 4}; still later, four faces of an icosahedron {3, 5}. Finally, when the adjustable mirror is ver- 
tical like the others, we see a theoretically infinite number of faces of the regular tessellation 
{3, 6}, subdivided in the manner of Figure 4.6d. This device, employing ordinary rectangular 
mirrors, is a simplified version of Mdbius’s trihedral kaleidoscope in which the three mirrors are 
cut in the shape of suitable sectors of a circle [Coxeter 1, p. 83]. 


When the E edges of the general Platonic solid { p, g} are projected from 
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its center onto a concentric sphere, they become E arcs of great circles, de- 
composing the surface into F regions which are “spherical p-gons.” In this 
manner the polyhedron yields a “spherical tessellation” which closely re- 
sembles the plane tessellation of §4.6. The symmetry group of {p, q} is 
derived from the symmetry group of one face by adding the reflection in a 
side of that face. Thus it is generated by reflections in the sides of a spheri- 
cal triangle whose angles are 7/p (at the center of a face), 7/2 (at the 
midpoint of an edge), and /q (at a vertex). This spherical triangle is a 
fundamental region for the group, since it is transformed into neighboring 
regions by the three generating reflections. 


Figure 15.74 


The network of such triangles, filling the surface of the sphere, is cut out 
by all the planes of symmetry of the polyhedron, namely the planes joining 
the center to the edges of both {p, q} and its reciprocal {g, p}. In Figure 
15.7a (where p and q are 3 and 5), alternate regions have been blackened so 
as to exhibit both the complete symmetry group A; x {I} and the rotational 
subgroup A;5, which preserves the coloring. 

Instead of deriving the network of spherical triangles from the regular 
polyhedron, we may conversely derive the polyhedron from the network. 
The ten triangles in the middle of Figure 15.7a evidently combine to form 
a face of the blown-up dodecahedron, and the six triangles surrounding a 
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point where the angles are 60° combine to form a face of the blown-up 
icosahedron. 


EXERCISES 


1. Interpret the symbol { p, 2} as a spherical tessellation (“dihedron’”’) whose faces 
consist of two hemispheres, and {2, p} as another whose faces consist of p lunes. 

2. How many planes of symmetry does each Platonic solid have? Provided p and 
q are greater than 2, this number is always a multiple of 3, namely 3c in the notation 
of Ex. | at the end of § 10.4. 

3. Dividing 47 by the area of the fundamental region, obtain a formula for the order 
of the symmetry group of {p,q}. Reconcile this with the formula for £ (the number of 
edges) in 10.32. 


15.8 DISCRETE GROUPS GENERATED BY INVERSIONS 


In the present section we make one more digression into Euclidean space, 
so as to be able to talk about inversion. (The absolute theory of inversion 
presents difficulties that would take us too far afield. [See Sommerville 1, 
Chapter VIII.]) 

Figure 15.7a, being an orthogonal projection, represents 10 of the 15 great 
circles by ellipses. (The difficult task of drawing it was undertaken by J. F. 
Petrie about 1932). An easier, and perhaps more significant, way to repre- 
sent such figures is by stereographic projection (§ 6.9), so that the great circles 
remain circles (or lines) [Burnside 1, pp. 406-407]. The reader can readily do 
this for himself, with the aid of the following simple instructions. 


Figure 15.80 


Figure 15.8a shows a square PORS with center O, and a regular pentagon 
VWXYZ with its sides extended to form a pentagram V’X’Z’W’Y’. With 
radius PQ and centers P, Q, R, S, draw four circles. These, along with two 
lines through O parallel to the sides of the square, represent 6 greai circles, 
one in each of the 6 planes of symmetry of the tetrahedron {3, 3}, which 
are the planes joining pairs of opposite edges of a cube. Adding the cir- 
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cumcircle and diagonals of the square, we have altogether 9 great circles, 
one in each of the 9 planes of symmetry of the cube {4, 3}, which include 
3 planes parallel to its faces. 

With radius VX’ (=VX) and centers V, W, X, Y, Z, draw five circles. 
With radius VW’ (= V’W’) and centers V’, W’, X’, Y’, Z’, draw five more 
circles. These ten circles, along with the five lines VV’, WW’, XX’, YY’, ZZ’, 
represent 15 great circles (Figure 15.7a), one in each of the 15 planes of sym- 
metry of the icosahedron {3, 5} or of the dodecahedron {5, 3}. (These planes 
join pairs of opposite edges of either solid.) 

To justify these statements we merely have to examine the curvilinear tri- 
angles* and observe that each has angles w/p, m/q, 7/2. 


Since stereographic projection is an inversion (Figure 6.9a), and since an 
inversion transforms a reflection into an inversion, the figures so constructed 
are, in fact, representations of the abstract groups S4, S4 x C2,and As xX C2 
as groups generated by inversions. In other words, they are configurations 
of circles so arranged that the whole figure is symmetrical by inversion in 
each circle. (Of course, any straight lines that occur are to be regarded as 
circles of infinite radius. As we saw in §6.4, inversion in such a “circle” is 
simply reflection in the line.) Any one of the regions into which the plane 
is decomposed will serve as a fundamental region, and the generators of the 
group may be taken to be the inversions in its sides. 

For a group generated by just one inversion, we may invert the circle into 
a straight line so as to obtain the group D, of order 2, generated by a single 
reflection (§ 2.5). The groups generated by inversions in two intersecting 
circles are essentially the same as the groups D, of order 2n, generated by 
reflections in two intersecting lines (§ 2.7). If the circles of two generating 
inversions are in contact, they can be inverted into parallel lines, and we 
have the limiting case D,. (Figure 3.75). Two nonintersecting circles can 
be inverted into concentric circles. Inversions in them generate an infinite 
sequence of concentric circles whose radii are in geometric progression. 
Abstractly, the group is again D,, but the center is a “point of accumula- 
tion” (§ 7.6). So is the point of contact in the case of the group generated 
by inversions in two touching circles. A group is said to be discrete if it 
has no points of accumulation. Thus, in describing discrete groups gener- 
ated by inversions, we may insist that every two of the generating circles in- 
tersect properly, and do not touch. 

For a discrete group generated by three inversions, the fundamental region 
is a curvilinear triangle whose angles are submultiples of 7: say 7/p1, 7/po, 
m/p3. For instance, two radii of a circle, forming an angle 7/p, cut out a 
sector which may be regarded as a “triangle” with angles 7/p, 7/2, 7/2; this 
is a fundamental region for the group D, x Dy of order 4p, generated by re- 
flections in the radii and inversion in the circle. In this case 


*For the effect of projecting in a different direction, see Coxeter, American Mathematical 
Monthly, 45 (1938), pp. 523-525, Figs. 4 and 5. 
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l I l 
oe Po ps pe 
so that the angle sum of the triangle is greater than 7: an obvious conse- 
quence of the fact that the sector is derived from a spherical triangle (see 
§ 6.9) by stereographic projection, which preserves angles. Every solution 
of the inequality 15.81 (cf. 15.42) is a triangle that can be drawn with great 
circles on a sphere. We thus obtain again the symmetry groups 


Sa, Sa X Ca, As X Ce 


of the Platonic solids. 

When 1/py + 1/p2 + 1/p3 = 1, so that the angle sum is exactly 7, we 
have the infinite “Euclidean” groups p6m, p4m, p31m (see Table I and 
Figure 4.6d). We could transform all the straight lines into circles by means 
of an arbitrary inversion; but then, since the pattern is infinitely extended, 
the center of inversion would be a point of accumulation. 

When |/p; + 1/p2 + 1/p3 < 1, so that the angle sum of the fundamental 
region is less than 7, we may still take two of the three sides to be straight, but 
now their point of intersection A is outside the circle g to which the third side 
belongs, with the result that there is a circle Q orthogonal to all three (Fig- 
ure 15.85); the tangents from A to gq are radii of Q. 

Since Q is invariant for each of the generating inversions, it is invariant 
for the whole group. The circle g decomposes the interior of Q into two 


Figure 15.8b 
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unequal regions and inverts each of these regions into the other. There- 
fore the number of triangles is the same in both regions. But the larger 
region includes a replica of the smaller. Hence, by Bolzano’s definition of an 
infinite set (namely, a set that has the same power as a proper subset), the 
number of triangles is infinite; that is, the group is infinite. 


Figure 15.8c¢ 


The case when p1, po, ps are 6, 4, 2 is shown in Figure 15.8c. Unlike Fig- 
ure 15.7a, this is not a picture of a solid object. Our familiarity with three- 
dimensional space enables us to accept the idea that the triangles in Figure 
15.7a are all the same size, even though the peripheral ones are made to look 
smaller by perspective foreshortening. In the case of Figure 15.8c, the 
smaller peripheral triangles are essentially the same shape as those in the 
middle (since they have the same angles), but we no longer find it easy to 
imagine that they are, in some sense, the same size. In trying to stretch our 
imagination to this extent, we are taking a first step towards appreciating 
hyperbolic geometry, which is the subject of our next chapter. 

The reader may wonder why we admit such groups as being worthy of con- 
sideration, seeing that the circle Q contains infinitely many points of accumu- 
lation. However, when we accept the non-Euclidean standpoint, so that the 
circles and inversions are regarded as lines and reflections, the consequent 
distortion of distance makes infinitely far away, so that the points of ac- 
cumulation disappear. 
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EXERCISES 


1. Ifa system of concentric circles is transformed into itself by inversion in each 
circle, the radii are in geometric progression. 

2. If three circles form a “triangle” with angles 7/pi, 7/pe, 7/ps, the inversions Ry, 
Rg, Rg in its sides satisfy the relations 


Ri? = Rp? = R32 = (RoR3)" = (R3R,)’2 = (RiRo)”s =. 


These relations suffice to define the abstract group generated by Ry, Re, Rs [Coxeter 
and Moser 1, pp. 37, 55]. 

3. Given an angle 7/p, at the center A ofa circle Q of unit radius, as in Figure 15.8), 
find expressions (in terms of pi and pe) for the radius of the circle g and for the distance 
from A to its center, in the case when p3 = 2. 


4. Invert Figure 15.8c in a circle whose center lies on 2; that is, replace the circle Q 
by a straight line, so that all the inverting circles have their centers on this line. (Such 
an arrangement provides an alternative proof that the group is infinite. For if its order 
is g, the infinite half plane is filled with g curvilinear triangles, each having a finite area!) 

5. In Figure 15.8c, two of the small triangles (one white and one black) with a com- 
mon hypotenuse form together a “curvilinear kite” having three right angles and one 
angle of 60°. Trace part of the figure so as to exhibit a network of such kites, alter- 
nately white and black. We now have an instance of a group generated by four in- 
versions. Can it happen that more than four inversions are needed to generate a 
discrete group? 


16 


Hyperbolic geometry 


Absolute geometry is not categorical: it is two geometries in one. To be 
precise, it leaves open the question of the existence of ultraparallel lines (see 
the end of § 15.2). In § 16.1 we shall compare the two possible answers, giv- 
ing the unfamiliar the same status as the familiar. In § 16.2 we shall justify 
this action by means of a proof of relative consistency. Thereafter, casting 
aside all scruples, we shall plunge wholeheartedly into the “new universe” 
which Bolyai “created from nothing.” 


16.1. THE EUCLIDEAN AND HYPERBOLIC AXIOMS OF PARALLELISM 


In the author there lives the perfectly purified conviction (such as he 
expects too from every thoughtful reader) that by the elucidation of 
this subject one of the most important and brilliant contributions has 
been made to the real victory of knowledge, to the education of the 
intelligence, and consequently to the uplifting of the fortunes of men. 


J. Bolyai (1802 -1860) 
[Carslaw 1, p. 31] 


In § 12.6, we mentioned the question whether the two rays parallel to a 
given line r from an outside point A are, or are not, collinear. By applying 
a suitable isometry, we see that the answer is independent of the position 
of r. 

It is true, though less obvious, that, for a given r, the answer is independent 
of the position of A. Suppose, if possible, that the rays parallel to r from A 
are the two halves of a line q while the rays parallel to r from another point A’ 
form an angle, as in Figure 16.la. By the transitivity of parallelism, these 
rays from A’ are parallel to g and also to the infinite sequence of parallel lines 
derived from q and r by applying the group D., generated by reflections in ¢ 
and r (Figure 3.75). We obtain a manifest absurdity by considering any one 
of these lines that lies beyond A’ (i.-e., in such a position that A’ lies between 
that line and r). (Strictly, this argument makes use of the so-called Axiom 
of Archimedes, 13.31, which is a consequence of 12.51.) 


2o7 
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Figure 16.14 


Thus we have a clear-cut distinction between two kinds of geometry, called 
Euclidean and hyperbolic, which are derived from absolute geometry by add- 
ing just one of the following two alternative axioms: 


THE EUCLIDEAN AXIOM. For some point A and some line r, not through A, 
there is not more than one line through A, in the plane Ar, not meeting r. 


THE HYPERBOLIC AXIOM. For some point A and some line r, not through A, 
there is more than one line through A, in the plane Ar, not meeting r. 


EXERCISE 


Each of these axioms implies the stronger statement with “some point 4 and some 
line r” replaced by ‘“‘any point A and any line r.” The Euclidean axiom, so amended, 
is equivalent to the celebrated Postulate V (our 1.25). How does Postulate V break 
down if we assume the hyperbolic axiom? 


16.2 THE QUESTION OF CONSISTENCY 


What are we ta think of the question: Is Euclidean Geometry true? It 

has no meaning. We might as well ask... if Cartesian coordinates are 

true and palar coordinates false. One geometry cannot be more true 
than another; it can only be more convenient. 

H. Paincaré (1854 -191 2) 

(Science and Hypothesis, New Yark, 1952) 


We observe that the Euclidean and hyperbolic axioms differ by just one 
word: the vital word “not.” It is meaningless to ask which of the two ge- 
ometri¢s is true, and practically impossible to decide which provides a more 
convenient basis for describing astronomical space. From the standpoint of 
pure mathematics, a more important question is whether either axiom is logi- 
cally consistent with the remaining axioms of absolute geometry. Even this 
is difficult to answer; for according to the philosopher Gédel, there is no in- 
ternal proof of consistency for a system that includes infinite sets. We have 
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to be content with re/ative consistency: if Euclidean geometry is free from 
contradiction, so is hyperbolic geometry, and vice versa. Relative consist- 
ency is established by finding in each geometry a model of the other. 

One Euclidean model of the hyperbolic plane (due to Poincaré) was men- 
tioned in § 15.8. This uses a circle Q, as in Figure 16.2@. Each pair of inverse 
points represents a hyperbolic point, and each circle orthogonal to Q repre- 
sents a hyperbolic line. The two parallels tor from A are simply the circles 
through A that touch r at its points of intersection with Q. (These points are 
the “ends” of r.) We call this a conformal model because angles retain their 
proper values though distances are inevitably distorted. 


A 
ve 
7 
Q (a) 
Figure 16.20 Figure 16.2b 


A different Euclidean model, suggested by Beltrami (1835-1900), uses an- 
other circle w, as in Figure 16.2b. Each point inside w represents a hyper- 
bolic point. The two parallels tor from A are the chords joining A to the ends 
of the chord r. (Chords whose lines intersect outside w represent ultra- 
parallel lines.) We call this a projective model because straight lines remain 
straight. Nothing is lost if we replace the circle w in the Euclidean plane by a 
conic in the projective plane. In fact, much is gained; for it is possible to ex- 
tend the hyperbolic plane into a projective plane by means of entities defined 
in the hyperbolic geometry itself [Coxeter 3, p. 196]. In this way we can 
prove that hyperbolic geometry is unique or categorical [Borsuk and Szmie- 
lew 1, p. 345], unlike absolute geometry, which includes two contrasting 
possibilities. 

When using models, it is desirable to have two rather than one, so as to 
avoid the temptation to give either of them undue prominence. Our geo- 
metric reasoning should all depend on the axioms. The models, having 
served their purpose of establishing relative consistency [Pedoe 1, p. 61; 
Sommerville 1, pp. 154-159], are no more essential than diagrams. 

Klein (4, p. 296] exhibited a connection between the conformal and pro- 
jective models in the manner of Figure 16.2c. A sphere, having the same 
radius as w, touches the (horizontal) plane at S, the center of both w and Q. 
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Figure 16.2c 


Beginning with the projective model, we use orthogonal (vertical) projection 
to map w on the “equator” y of the sphere, and each interior point on two 
points: one in the southern hemisphere and another (not shown) in the north- 
ern hemisphere. Every chord of w yields a circle in a vertical plane, that is, 
a circle orthogonal to 7. We now map the sphere back into the plane by 
stereographic projection, so that 9 projects into the larger circle Q, concen- 
tric with w. Because of the angle-preserving and circle-preserving nature 
of stereographic projection, the vertical circles yield horizontal circles ortho- 
gonal to Q, and we have the conformal model. 

Instead of stereographic projection onto the tangent plane at the “south 
pole” S (i.e., inversion with respect to a sphere of radius NS), we could have 
used stereographic projection (from the same “north pole” N) onto the equa- 
torial plane (i.e., inversion with respect to a sphere through 7) so as to make 
both w and Q coincide with y [Coxeter 3, p. 260]. Klein’s procedure is justi- 
fied by its property of making the two models agree in the immediate vicinity 
of S. This must have seemed to him more important than making them 
agree “at infinity.” 

It must be remembered that both models are in one respect misleading: 
they give us the impression that the center S should play a special role, 
whereas, in the abstract hyperbolic plane, all points are alike. 

For the sake of completeness, we should mention the problem that the in- 
habitants of a hyperbolic world would face in trying to visualize the Euclid- 
ean plane. One solution [Coxeter 3, pp. 197-198] is that they could repre- 
sent the Euclidean points and lines by the lines and planes parallel to a given 
ray in hyperbolic space! 


EXERCISES 


1. Reflection in a line of the hyperbolic plane appears, in the conformal model, as in- 
version with respect to a circle, and in the projective model as a harmonic homology. 
What is the corresponding transformation in the space of Klein’s sphere? 
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2. Circles appear as circles (not meeting 9) in the conformal model, and therefore 
as circles on the sphere (say, in the southern hemisphere) and as ellipses in the pro- 
jective model. 


16.3 THE ANGLE OF PARALLELISM 


. . a sea-change info something rich and strange. 


W. Shakespeare (1564 -1616) 
(The Tempest, Act |, Scene 2) 


A 
N M 
B r 
Figure 16.340 


For the rest of this chapter the geometry will be hyperbolic, that is, 
we shall assume the hyperbolic axiom, which implies that, for any point A 
and line r, not through A, the two parallels form an angle NAM, as in Fig- 
ure 16.3a. From A draw AB perpendicular to r. Reflection in AB shows 
that 2 BAM and 4 NAB are equal acute angles. Following Lobachevsky, 
we call either of them the angle of parallelism corresponding to the distance 
AB, and write 


ZBAM = IAB). 


Before we can prove that this function is monotonic, we need a few more 
properties of asymptotic triangles. While proving 15.26 we discovered that, 
if a transversal (AD in Figure 15.2f) meets two lines in such a way that the 
“alternate” angles are equal, then the two lines are ultraparallel. Hence 
[Carslaw 1, p. 48]: 


16.31 Jn an asymptotic triangle EFM, the external angle at E (or F) is 
greater than the internal angle at F (or E). 


In other words, the sum of the angles of an asymptotic triangle is less 
than 7. This will enable us to prove a kind of converse for Theorem 15.25, 
to the effect that an asymptotic triangle is determined by its two positive 
angles: 


16.32 Jf two asymptotic triangles AEM, A'E'M’ have A = A’ and E = E’, 
then AE = A'E’. 
Proof [Carslaw 1,p.50]. If AZ and A’E” are not equal, one of them must 
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A A’ 


M M 
F > E’ — 


Figure 16.3b 


be the greater; let it be A’E’, as in Figure 16.3b. On E/A, take Fso that AF 
= A’E’, and draw FM parallelto AM. By 16.31 and 15.25, we have 


LMEA > ZMFA = £M'E'A' = ZMEA, 


which is absurd. 

These results will enable us to establish the existence of a common parallel 
to two given rays forming an angle NOM, that is, a line MN which is parallel 
to OM at one end and to ON at the other. From the given rays OM, ON, 
cut off any two equal segments OA, OA’, as in Figure 16.3c. Draw A’M 
parallel to OM, and AN parallelto ON. Bisect the angles NAM and NA’M 
by lines aanda’. We shall prove that these lines are ultraparallel, and that 
the desired common parallel MN is perpendicular to both of them. 


Cc 


| 

| 

| 

| 

| 

| 
ab 

| 

| 

v 

L 
Figure 16.3c 


Let A’M meet AN in C, andain E. Since the whole figure is symmetrical 
by reflection in OC, the two angles at A and the two angles at A’ are all equal. 

If possible, let a and a’ have a common point L, which is, of course, equi- 
distant from A and A’. Applying 15.25 to the congruent asymptotic triangles 
ALM and A’LM, we deduce that 2 MLA = Z MLA’, which is absurd. 

If possible, let a and a’ be parallel, with a commonend L. Applying 16.32 
to the congruent asymptotic triangles AEM and A’EL, we deduce that AE = 
A’E, whence E coincides with C, which is absurd. 

We conclude that a and a’ are ultraparallel. By 15.26, they have a com- 
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mon perpendicular FF’. Applying 15.25 to the congruent asymptotic tri- 
angles AFM and A’F’M, we conclude that 


ZLMFA = 2 MFA’. 


If FF were not parallel to OM, we would have an asymptotic triangle FF’ M 
whose angle sum is 7, contradicting 16.31. Hence, in fact, F’ Fis parallel to 
OM, and similarly FF’ to ON; that is, the line FF’ is a common parallel to 
the two rays as desired. 

Moreover, this common parallel is unique, since two such would be paral- 
lel to each other at both ends, contradicting the “clear-cut distinction” 
between the Euclidean and hyperbolic properties of parallelism (Figure 
16.la), It follows that 


16.33 Any two ultraparallel lines have a unique common perpendicular. 


For, given @ and a’, we can reconstruct Figure 16,3c as follows: draw any 
common perpendicular FF’, take O on its perpendicular bisector, and let the 
two parallels through O to the line FF’ meet ain A, a’ in A’. 


For the sake of brevity, we have been content to assert the existence of a 
line through a given point parallel to a given ray, and of a common perpen- 
dicular to two given ultraparallel lines. Actual “ruler and compasses” con- 
structions for these lines have been given by Bolyai and Hilbert, respectively 
[see Coxeter 3, pp. 204, 191]. Hilbert apparently failed to notice that his 
construction for the common parallel to AM and A’‘N remains valid if these 
lines meet in a point that is not equidistant from A and A’, or even if they do 
not meet at all, In fact [Carslaw 1, p. 76], 


16.34 Any two nonparallel rays have a unique common parallel. 


This result justifies our use of ends as if they were ordinary points: any 
two ends, M and N, determine a unique line MN. 


Figure 16.3d 


The line through A parallel to Bf (Figure 16.3a or d) determines the 
angle of parallelism I(4A B). Conversely, we can now find a distance x 
whose angle of parallelism II(x) is equal to any given acute angle [Carslaw 
1, p. 77]. In other words, given an acute angle CAM, we can find a line 
BM which is both perpendicular to AC and parallel to AM. We merely have 
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to reflect AM in AC, obtaining AN, and then draw the common parallel 
MN, which meets AC in the desired point B. Incidentally, since we can 
draw through any point a ray parallel to a given ray, it follows that 


16.35 For any two nonperpendicular lines we can find a line which is per- 
pendicular to one and parallel to the other. 


If A’ is on the ray A/B, so that A’B > AB (as in Figure 16.3d), then 
Ii(A’B) < II(AB). 


(This is simply 16.31, applied to the asymptotic triangle 4A4’M.) It follows 
that the function II() decreases steadily from 47 to 0 when x increases from 
0 to oo. 

We naturally call AMWN a doubly asymptotic triangle [Coxeter 3, p. 188]. 
We have seen that such a “triangle” is determined by its one positive an- 
gle; in other words, 

16.36 Two doubly asymptotic triangles are congruent if they have equal 
angles. 


Applying 16.34 to rays belonging to two parallel lines LM, LN, we ob- 
tain a third line parallel to both, forming a trebly asymptotic triangle LMN. 
In view of Bolyai’s remark 15.24, we may regard such a triangle as a doubly 
asymptotic triangle whose angle is zero. Accordingly, we shall not be sur- 
prised to find that 


16.37 Any two trebly asymptotic triangles are congruent. 

Proof (due to D. W. Crowe). Given any two trebly asymptotic triangles, 
dissect each into two right-angled doubly asymptotic triangles by drawing 
an altitude (perpendicular to one side and parallel to another, as in 16,35), 
By 16.36, all the four doubly asymptotic triangles are congruent. There- 
fore the two trebly asymptotic triangles must be congruent. 


EXERCISES 


1. Draw figures for Theorems 16,33-16.35 in terms of the conformal and projective 
models. 

2. Ifa quadrangle ABED has right angles at D and E while AD = BE, then the 
angles at A and B are equal acute angles. (Hint: Draw AM and BM parallel to D/E; 
apply 16.31 to the asymptotic triangle 4 BM.) 

3. The sum of the angles of any triangle is less than two right angles. (Hint: For 
a given triangle ABC, draw AD, BE, CF perpendicular to the line joining the midpoints 
of BC and CA.) 

4. Given an asymptotic triangle ABM with acute angles at both A and B, draw 
AD perpendicular to BM, and BE perpendicular to AM, meeting in H. Draw HF 
perpendicular to AB. Then FH is parallel to AM [Bonola 1, p. 106]. What happens 
if we deal similarly with rays through A and B which are not parallel but ultra- 
parallel? 

5. If two trebly asymptotic triangles have a common side, by what isometry are 
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they related? (Of course, two trebly asymptotic triangles may have a common side 
without having a common altitude). 

6. The inradius of a trebly asymptotic triangle is the distance whose angle of paral- 
lelism is 60°. 

7. From any point on a side of a trebly asymptotic triangle, lines drawn perpen- 
dicular to the other two sides are themselves perpendicular [Bachmann 1, p. 222]. 


16.4 THE FINITENESS OF TRIANGLES 


| could be bounded in a nutshell and 
count myself a king of infinite space. 


W. Shakespeare 
(Hamlet, Act Il, Scene 2) 


One of the most elegant passages in the literature on hyperbolic geom- 
etry since the time of Lobachevsky is the proof by Liebmann [1, p. 43] that 
the area of a triangle remains finite when all its sides are infinite. C. L. 
Dodgson (alias Lewis Carroll) could not bring himself to accept this the- 
orem; consequently he believed non-Euclidean geometry to be nonsense. 

Instead of pursuing a philosophical discussion of the meaning of area 
[Carslaw 1, pp. 84-90], let us be content to regard it as a numerical func- 
tion, defined for every simple closed polygon, invariant under isometries, 
and additive when two polygons are juxtaposed. 

Let ABM be any asymptotic triangle. Reflect it in the bisector AF of 
the angle A to obtain AA,N, as in Figure 16.4a, F being the point where 
the bisector meets the common parallel MN. Reflect the line BM in the 
bisector AyF, of 2 NAM to obtain A2N (with Az on AM), and then reflect 


G F FF F F, Ky F 


Figure 16.4a 
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this in AF. Continuing in this manner, we construct a network of triangles 
whose “vertical” sides bisect the angles at B, A, Aj, Ao, Az, ... and are per- 
pendicular to MN at G, F, Fi, Fo, F3,.... These points are evenly spaced 
along MN, since they are all derived from F and F; by the group D., gen- 
erated by reflections in AF and AF; for instance, G is the image of F, in 
the mirror AF. The numbered triangles which fit together to fill the asymp- 
totic triangle ABM are respectively congruent to those which fit together 
within the finite pentagon A BGF,Aj; in fact, any two triangles that are num- 
bered alike are related by some power of the translation from G to F; (or 
from F to Fy). Hence the area of the asymptotic triangle is less than or 
equal to the area of the pentagon: 


16.41 Any asymptotic triangle has a finite area. 


Since any doubly asymptotic triangle (Figure 16.3a) can be dissected into 
two asymptotic triangles, it follows that 


16.42 Any doubly asymptotic triangle has a finite area. 


By 16.36, the area of a doubly asymptotic triangle is a function of its an- 
gle. Comparing the triangles A MN and A’ MN of Figure 16.3d, we see that 
this is a decreasing function: the larger triangle has the smaller angle. 

Since any trebly asymptotic triangle can be dissected into two doubly 
asymptotic triangles (as in the proof of 16.37), 16.42 implies 


16.43 Any trebly asymptotic triangle has a finite area. 


By 16.37, this area is a constant, depending only on our chosen unit of 
measurement. 


16.5 AREA AND ANGULAR DEFECT 


Gauss ... did not recognize the existence of a logically sound non- 
Euclidean geametry by intuition or by a flash of genius: .. . on the 
contrary, he had spent upon this subject many laborious hours before 
he had overcome the inherited prejudice against it. [He] did not let 
any rumour of his opinions get abroad, being certain that he would 
be misunderstood. Only to a few trusted friends did he reveal some- 
thing of his work, 


R. Bonola [1, pp. 66 -67] 


Janos Bolyai, or Bolyai Janos (as it is written in Hungarian), announced 
his discovery of absolute geometry in an appendix to a book by his father, 
Bolyai Farkas, who was a friend of Gauss. When Gauss saw this book and 
read the appendix, he wrote a remarkable letter to his old friend, congratu- 
lating Janos and admitting that he himself had thought along the same lines 
without publishing the results. The original letter (of March 6, 1832) is lost, 
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but the younger Bolyai’s copy of it has been preserved, and it was eventu- 
ally published in Gauss’s collected works [Gauss 1, vol. 8, pp. 220-225]. 

This letter contains a wonderful proof that the area of a triangle ABC is 
proportional to its angular defect 


a—-A—B—C: 


the amount by which its angle sum falls short of two right angles. The fol- 
lowing paraphrase fills up a few gaps in the argument, while retaining 
Gauss’s systematic division into seven steps, numbered with Roman nu- 
merals. 


J. All trebly asymptotic triangles are congruent. (This is our 16.37.) 


I]. The area of a trebly asymptotic triangle has a finite value, say t. (This 
is our 16.43.) 


Wil. The area of a doubly asymptotic triangle AMN is a function of its an- 
gle, NAM, say f(), where $ is the supplement of this angle. Given the angle 
, we can construct the triangle in a unique fashion (Figure 16.5a; cf. 16.3c). 
Gauss used the supplement, rather than the angle NA M itself, to ensure that 
/(@) is an increasing function of @. (See the remark after 16.42.) 


N 
Sx 
f(r = $)\ F@) 
f(@) 
N M L o\7-~¢ M 
A 
Figure 16.5a Figure 16.5b 


IV. (6) + f@ — 6) =t. 

This may be seen by fitting together two doubly asymptotic triangles AMN 
and ANL with supplementary angles, as in Figure 16.55. Here it is under- 
stood that 0<.@< 7. But when ¢ approaches zero, the doubly asymptotic 
triangle collapses, and when ¢ approaches 7 it tends to become trebly asymp- 
totic. Hence 


16.51 fO=0, fim=t, 
and IV is valid forO < @6 < a. 
V.f@+f/M+fa-—¢e@-W=t 
This, with ¢ > 0,¥ >0,¢ + ~ < a, may be seen by fitting together three 


doubly asymptotic triangles whose angles add up to 27, as in Figure 16.5c. 
It evidently remains valid when ¢ or yf is zero or @ + WW = @. 
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VI. f(¢) + f) = f@ + ¥). = 

This, with @ > 0,) > 0,6 4+ ¥ < 7, is obtained algebraically, by writing 
@ + Winstead of din IV and then using V. It follows that f(@) is simply a 
multiple of ¢, namely, 


16.52 S() = po 


where, by 16.51, » = t/a. 
J. H. Lindsay has pointed out that this deduction can be made without 
assuming the function to be continuous. By VI, with ¢ = y, 


L(G) = 4 f(29). 


Thus 16.52 holds when ¢ = 47, again when @ = jz, and so on; that is, it 
holds when ¢ is 7 divided by any power of 2. Appealing again to VI, we 
deduce that f(@) = pd whenever @ = nz, where n is a number which ter- 
minates when expressed as a “decimal” in the scale of 2 [cf. Coxeter 3, p. 
102]. For brevity, let us call this a binary number. 

Suppose, if possible, that, for some particular value of $, f(@) #~ pd. 
Choose a binary number # between the two distinct real numbers ¢/7 and 
f(6)/um. If f(6) > me, so that 


b<mn FO, 


we have, since f() is an increasing function, 


L(G) < far) = pnt < f(), 


which is absurd. If, on the other hand, f(¢) < pd,we can argue the same way 
with all the inequalities reversed. Hence, in fact, f(@) = pd for all the values 
of ¢ (from 0 to 7). 


N N 
MB 
se ia é\n 
Ooty 
L M L ue M 


Figure 16.5c¢ Figure 16.5d 


VII. The area A of any triangle ABC (with finite sides) is a constant multiple 
of its angular defect: 


Ae pe A CY 


For this final step, Gauss exhibited ABC as part of a trebly asymptotic 
triangle by extending its sides in cyclic order, as in Figure 16.5d. The re- 
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maining parts are doubly asymptotic triangles whose areas are pA, pB, uC, 
Hence 


A+ pA + pB + pC =t = po, 


and the desired formula follows at once. 


If we wish, we can follow Lobachevsky in using such a unit of measure- 
ment* that the area of a trebly asymptotic triangle is 7. Then p = 1, and 
the formula is simply 


16.53 A=a-A-—B-C. 


This is strikingly reminiscent of the formula 6.92, which tells us that the 
area of a spherical triangle drawn on a sphere of radius R is 


(A + B+ C — w)R®. 


In fact, setting R? = —1, we find that Gauss’s result agrees formally with 
the area of a triangle drawn on a sphere of radius 7. Long before the time 
of Gauss, it was suggested by J. H. Lambert (1728-1777) that, if a non- 
Euclidean plane exists, it should resemble a sphere of radius i. This analogy 
enabled him to derive the formulas of hyperbolic trigonometry (which were 
later developed rigorously by Lobachevsky) from the classical formulas of 
spherical trigonometry. Its full significance did not appear till Minkowski 
(1864-1909) discovered the geometry of space-time, which provided a geo- 
metrical basis for Einstein’s special theory of relativity. We know now that, 
in a (2 + 1)-dimensional space-time, the hyperbolic plane can be repre- 
sented without distortion on either sheet of a sphere of time-like radius. In 
the underlying affine space, this kind of sphere is a hyperboloid of two 
sheets.} 
EXERCISES 


1. Gauss’s formula 16.53 remains valid when the triangle has one or more zero 
angles. 


2. The area of any simple p-gon is equal to its angular defect: the amount by which 
its angle sum falls short of that of a p-gon in the Euclidean plane. (Hint: Dissect the 
polygon into triangles. Of course, we are now assuming p = 1.) In Figure 16.4a, the 
area of ABM is equal to that of ABGF,A. 


3. The product of three translations along the directed sides of a triangle (through 
the lengths of these sides themselves) is a rotation through the angular defect of the 
triangle. (These translations are half as long as those in Donkin’s theorem, 15.31.) 
[Lamb 1, p. 7.] 

4. The product of half-turns about the midpoints of the sides of a simple quadrangle 
(in their natural order) is a rotation through the angular defect of the quadrangle. 

5. Any polygon whose angle sum is a submultiple of 27 can be repeated, by half- 

* Coxeter, Hyperbolic triangles, Scripta Mathematica, 22 (1956), p. 9. 

+ Coxeter, A geometrical background for de Sitter’s world, American Mathematical Monthly, 50 
(1943), p. 220. 
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turns about the midpoints of its sides, so as to cover the whole plane without interstices 
[cf. Somerville 1, p. 86, Ex. 15]. (Hinz: See Figures 4.25 and c.) 


16.6 CIRCLES, HOROCYCLES, AND EQUIDISTANT CURVES 


A circle is the orthogonal trajectory of a pencil of lines with a real vertex... 
A haracycle is the orthogonal trajectory of a pencil of parallel lines.... The 
orthogonal trajectory of a pencil of lines with an ideal vertex ... is called an 
equidistant-curve. 


D. M. Y. Sommerville (1879-1934) 
[Sommerville 1, pp, 51-52] 


By 15.26, any two distinct lines are either intersecting, parallel, or ultra- 
parallel. In other words, they belong to a pencil of lines of one of three 
kinds: an ordinary pencil, consisting of all the lines through one point, a 
pencil of parallels, consisting of all the lines parallel to a given ray, or a pen- 
cil of ultraparallels, consisting of all the lines perpendicular to a given line. 
By 15.32, the product of reflections in the two lines is a rotation, a parallel 
displacement, or a translation, respectively. By fixing one of the two lines 
and allowing the other to vary in the pencil, we see that each of these three 
kinds of direct isometry can be applied as a continuous motion. 

A circle with center O may be defined either as in § 15.1 or to be the locus 
of a point P which is derived froma fixed point Q (distinct from O) by con- 
tinuous rotation about O, or to be the locus of the image of Q by reflection in 
all the lines through O. When the radius OQ becomes infinite, we have a 
horocycle with center M (at infinity): the locus of a point which is derived 
from a fixed point Q by a continuous parallel displacement, or the locus of 
the image of Q by reflection in all the lines parallel to the ray OM [Coxeter 
3, p. 213]. The rays parallel to QM are called the diameters of the horo- 
cycle. The first “‘o” in the word “horocycle” is short, as in “horror,” 

The locus of a point at a constant distance from a fixed line o is not a pair 
of parallel lines, as it would be in the Euclidean plane, but an equidistant 
curve (or “hypercycle”’), having two branches, one on each side of its axis o. 
Either branch may be described as the locus of a point which is derived from 
a fixed point Q (not on 0) by continuous translation along 9, or as the locus of 
the image of Q by reflection in all the lines perpendicular to 9, 

Orthogonal to the pencil of lines through O we have a pencil of concentric 
circles, A rotation about O permutes the lines and slides each circle along 
itself. Orthogonal to the pencil of parallels with a common end M we havea 
pencil of concentric horocycles. A parallel displacement with center M per- 
mutes the parallel lines and slides each horocycle along itself. Orthogonal to 
the pencil of ultraparallels perpendicular to o we have a pencil of ccaxal equi- 
distant curves, A translation along o permutes the ultraparallel lines and 
slides each equidistant curve along itself. 
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We are now ready to fulfill the promise, made after 15.31, to show that 
“the product of two translations with nonintersecting axes may be a rotation,” 
Referring to Figure 16.3d, we see that the line through C perpendicular to AB 
is ultraparallel to AM and AN. Therefore, it has a common perpendicular 
GH with AM, and a common perpendicular FE with AN, forming a penta- 
gon AEFGH with right angles at E, F, G, H as in Figure 16.6a. The re- 
maining angle (at 4) may be as small as we please; if it is zero, the pentagon 
is “asymptotic.” The product of reflections in AEF and FG is a translation 
along EF (through 2EF), The product of reflections in FG and AZ is a 
translation along GH (through 2GH). Hence the product of these two trans- 
lations is the same as the product of reflections in AE and AH, which is a ro- 
tation or, if A is an “end,” a parallel displacement. Since the axes of the two 
translations are both perpendicular to FG, we have thus proved that the 
product of translations along two ultraparallel lines may be either a rotation 
ora parallel displacement. (Of course, it may just as easily be another trans- 
lation.) 


Figure 16.6a Figure 16.6b 


The product of translations along two parallel lines, AM and BM, leaves 
invariant the common end M; therefore it cannot be a rotation, but must 
be either a translation along another line through M or a parallel displace- 
ment with center Mf. We shall soon see that the latter possibility arises when 
the two given translations are of equal length, one towards M and the other 
away from M. Jn fact, the translation along AM from A to A’ (Figure 16.6d) 
transforms the arc AB of a horocycle through A into an equal arc A’B’ of the 
concentric horocycle through A’, Let Bo denote the point in which the latter 
arc is cut by the diameter through B. The translation along this diameter 
from Bo to B transforms the arc BoA’ of the second horocycle into the equal 
arc BA” of the first. Thus the product of the two translations is the parallel 
displacement that transforms the arc AB into A” B’; it slides this horocycle 
(and every concentric horocycle) along itself. 


EXERCISES 


1. The three vertices of a (finite) triangle all lie on each of three equidistant curves, 
whose axes join midpoints of pairs of sides, and on a fourth “cycle,” which may be either 
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a circle or a horocycle or another equidistant curve (with all three vertices on one 
branch), [Sommerville 1, pp. 54, 189.] 

2. The three sides of a (finite) triangle all touch a circle (the incircle) and three other 
“cycles,” each of which may be of any one of the three kinds, 

3. In Figure 15.2a, the horocycle through J with diameter p, passes also through L. 

4. How many horocycles will pass through two given points? 

5. An equidistant curve may have as many as four intersections with a circle or a 
horocycle or another equidistant curve. 

6. Develop the analogy between conics in the affine plane and generalized circles 
in the hyperbolic plane. A horocycle, like a parabola, goes to infinity in one direction: if 
the points P and Q on it are variable and fixed, respectively, the limiting position of 
the line OP is the diameter through Q. An equidistant curve, like a hyperbola, has 
two branches. 

7. Unlike the conjugate axis of a hyperbola, the axis of an equidistant curve is on 
the concave side of each branch. 


16.7. POINCARE’S “HALF-PLANE” MODEL 


There is a gain in simplicity when the fundamental circle is taken as a 
straight line, say the axis of x... . We may avoid dealing with pairs 
of points by considering only those points above the x-axis. A proper 
circle is represented by a circle lying entirely above the x-axis; a 
horocycle by a circle touching the x-axis; an equidistant-curve by the 
upper part of a circle cutting the x-axis together with the reflexion of 
the part which lies below the axis. 


D. M. Y. Sammerville [1, pp. 188 -189] 


From the conformal model (Figure 16.2a) in which the lines are repre- 
sented by circles (and lines) orthogonal to a fixed circle Q, Poincaré derived 
another conformal model by inversion in a circle whose center lies on @, The 
inverse of Q is a line, say a “horizontal” line, which we shall again denote by 
Q. The points of the hyperbolic plane are represented by pairs of points 
which are images of each other by reflection in Q, and the lines are repre- 
sented by circles and lines orthogonal to , that is, circles whose centers lie 
on Q, and vertical lines [Burnside 1, p, 387]. 

Through a pair of points which are images in Q, we can draw an inter- 
secting pencil of coaxal circles (like Figure 6.5a turned through a right angle) 
representing an ordinary pencil of lines. The orthogonal nonintersecting 
pencil, having Q for its radical axis, represents a pencil of concentric circles, 
The limiting points of the nonintersecting pencil represent the common cen- 
ter of the concentric circles. 

Another pencil of circles (situated as in Figure 6.5a itself) can be drawn 
through two points on Q. One member of this pencil, having its center on 
Q, represents a line 0. The remaining circles (or strictly, pairs of them re- 
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lated by reflection in Q) represent coaxal equidistant curves with axis 0, For, 
the orthogonal nonintersecting pencil represents the pencil of ultraparallel 
lines perpendicular to o. 

A tangent pencil of circles whose centers lie on Q (Figure 6.55) represents 
a pencil of parallels, whereas the orthogonal tangent pencil (touching (2) rep- 
resents a pencil of concentric horocycles. One particular pencil of parallels 
(special in the model but, of course, not special in the hyperbolic geometry it- 
self) is represented by all the vertical lines (which pass, like itself, through 
the point at infinity of the inversive plane), The horocycles having these 
lines for diameters are represented by all the horizontal lines except Q (or 
strictly, pairs of such lines related by reflection in Q). Since reflections in 
the vertical lines represent reflections in the parallel lines, horizontal trans- 
lations represent parallel displacements. Hence the horizontal lines (other 
than Q itself) represent the horocycles isometrically; equal segments repre- 
sent equal arcs, 


EXERCISES 


1. What figure is represented by two lines forn ‘ng an angle that is bisected by (2? 

2. When two ultraparallel lines are represented by nonintersecting circles (in either 
of Poincaré’s conformal models), the distance between the lines, measured along their 
common perpendicular, appears as the inversive distance between the circles (see Ex- 
ercise 5 of § 6.6). 

3. The angle of parallelism (Figure 16.3d on page 293) is 


TI(x) = 2 arctan e-?, 


16.8 THE HOROSPHERE AND THE EUCLIDEAN PLANE 


F. L. Wachter (1792-1817), .. ina letter to Gauss (Dec., 1816)... 
speaks of the surface to which a sphere tends as its radius approaches 
infinity... He affirms that even in the case of the Fifth Pastulate being 
false, there would be a geametry on this surface identical with that of 
the ardinary plane. 


R, Bonola [1, pp. 62 -63] 


The ideas in §§ 16.6 and 16.7 extend in an obvious manner from two to 
three dimensions. The locus of images of a point Q by reflection in all the 
planes through a point O is a sphere with radius OQ. Asa limiting case we 
have a horosphere with center M (at infinity): the locus of images of a point Q 
by reflection in all the planes parallel to the ray QM [Coxeter 3, p. 218]. The 
locus of images of a point Q by reflection in all the planes perpendicular toa 
fixed plane w is one sheet of an equidistant surface, which consists of points at 
a constant distance from w on either side. 

There is a conformal model in inversive space in which the points of 
hyperbolic space are represented by pairs of points related by reflection in 
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a fixed “horizontal” plane Q, and the planes are represented by spheres and 
planes orthogonal to Q, that is, spheres whose centers lie on Q, and vertical 
planes. The representation of lines (which are intersections of planes) fol- 
lows immediately. Of particular interest is the bundle of vertical lines, 
which represents the bundle of lines parallel to a given ray QM (special in 
the model, though not in the hyperbolic geometry itself). The horospheres 
that have these lines for diameters are represented by all the horizontal 
planes except &. Since every vertical plane provides a model (of the kind 
described in §16.7) for a plane in the hyperbolic space, each horizontal 
plane (except 02) represents a horosphere, and every line in the plane repre- 
sents a horocycle on the horosphere. Since distances along such lines agree 
with distances along the corresponding horocycles, the representation of the 
horosphere by the Euclidean plane is isometric: for any figure in the Eucli- 
dean plane there is a congruent figure on the horosphere (with lines replaced 
by horocycles), 

This astonishing theorem was discovered independently by Bolyai and Lo- 
bachevsky. For two different proofs, see Coxeter [3, pp. 197, 251]. It means 
that, along with ordinary spherical geometry, the inhabitants of a hyperbolic 
world would also study horospherical geometry, which is the same as Eucli- 
dean geometry! 


Part IV 


17 


Differential geometry of curves 


Differential geometry is concerned with applying the methods of analysis 
to geometry, especially to the study of curves and surfaces. Classically, 
the study is made in Euclidean space of three dimensions. But in the twen- 
tieth century other spaces, such as inversive, affine, or projective, have been 
used. Jn other words, differential geometry is still significant when there 
is no concept of distance, However, both distance and parallelism are usu- 
ally present, in which case the notion of a vector is fundamental, 

A curve, being the locus of a point P, is intimately associated with a varia- 
ble vector, namely the position vector 


— 
r= OP 


which goes from a fixed origin O to the point P. For simplicity we shall con- 
sider only rectifiable curves for which there is a well-defined tangent at each 
point [Kreyszig 1, p. 29]. 

After a preliminary discussion of vectors, we shall consider the curvature 
of plane curves, and the curvature and torsion of twisted curves, applying 
the results to many important special cases such as spirals and helices. 


17.1. VECTORS IN EUCLIDEAN SPACE 


We have already considered, in § 13.6, the affine properties of vectors, such 
as addition and subtraction, multiplication by numbers, independence, and 
the unique expression 


17.11 e=xe+y)f +2g 


for any vector ¢ as a linear combination of three basic vectors e, f, g. The 
time has now come to introduce two kinds of multiplication of vectors by 
one another, We shall employ the notation of J. W. Gibbs (1839-1903), al- 
though some authors, such as Birkhoff and MacLane [1, p. 175] and Forder 
[2], prefer that of H. Grassmann (1809-1877). 


ans 
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Euclidean geometry allows us to speak of the /ength (or “magnitude,” or 
“absolute value”) |a|, of any given vector a. If @ is the angle between a 
and another vector b, we define the inner (or “scalar”) product a+ b and the 
outer (or “vector”) product a x b by the formulas 


aeb = |a| |b| cos@, aX b= |a||b|sin@g, 


where g is the unit vector orthogonal to the plane ab on the side from which 
@ appears as a positive angle. The introduction of the auxiliary vector g 
(orthogonal to both a and b) is justified by the elegant algebra that follows. 


We see at once that, if m and 7 are numbers, 
ma+nb = mnachb, ma xX nb = mna x b, 
b-a = ach, bxa = —aXb. 


Thus inner multiplication is commutative (like the multiplication of num- 
bers), whereas outer multiplication is “anticommutative.” Sincea x a = 0, 
we naturally take a? to mean a‘a: 


a = |a/’. 


Two vectors a and b are orthogonal ifa+b = 0, parallelifa x b = 0. 


=> > 

Consider two vectors a = OA, b = OB, and let BN be the perpendicular 
from B to OA, as in Figure 17.1a. The algebraic distance ON (negative if 
Z AOB is obtuse) is called the projection of bona. If |a| = 1, so that ais 
a unit vector, this projection is clearly a+b. Removing the restriction to a 
unit vector, we find that a+b is |a| times the projection. This geometrical 
interpretation makes it easy to establish, for inner products, the distributive 
law 


a‘(b + b’))=a'b+4+a-b’, 
which may also be expressed as 
(b + b’)-a = b-a + b’-a 


in virtue of the commutative law b:a = a+b. Replacing b’ by —b’, we 
obtain the corresponding results for differences instead of sums. 

The distributive law provides a useful method for establishing certain iden- 
tities. Ifb and b’ stand for expressions which we wish to prove equal, it is 
sometimes helpful to introduce an arbitrary vector ¢ and to compare b-e 
with b’-e¢, If we find that 


b-c = b’sc 


for all choices of ¢ (or even for three independent e’s), we can safely assert 
that b = b’. For, since (b — b’)+¢ = 0, if b — b’ is not the zero vector it 
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Figure 17.1a Figure 17.1b 


must be orthogonal to ¢; and, since ¢ is arbitrary, this is impossible. 
As a step towards establishing the distributive law for outer products, we 
compare two expressions for the area of a parallelogram, namely 
|a x b| = |a| bf, 


where f is a unit vector orthogonal to a in the plane ab (as in Figure 17.1b) so 
that b-f is the altitude of the parallelogram from its base |a|. Analogously, 
the parallelepiped formed by three independent vectors a, b, ¢ has base 
|a xX b|, altitude ¢+g, and volume (with a suitable convention of sign, de- 
pending on whether the trihedron abc is positively or negatively oriented) 

ja x b| e-g = |a X b| gee = (a Xb). 


Since we could just as well regard another face of the parallelepiped as its 
base, the same volume is expressible as 

(b X c):a = ar(b X ¢). 
Thus we can interchange the cross and the dot: 

(a X b)-¢ = as(b X €). 


(This is as near as we can come to an “associative law” for products of vec- 
tors.) Since the dot and cross are interchangeable, it is convenient to use, 
for (a X b)-¢ or a-(b X ¢), the special symbol [a b ¢], so that the volume 
of the parallelepiped is 


[abe] — (bca] —[cab] = —[cbhal]. 
If [a b c] = 0, the parallelepiped collapses, and the three vectors are co- 


planar, that is, dependent, Thus a necessary and sufficient condition for 
a, b, c to be independent is 


[abc]~0., 


To prove the distributive law for outer products, we introduce an arbi- 
trary vector ¢ (like a catalyst) and find 
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(a +a’) X b}-e = (@+0’)-(b x ¢) 
= ar(b X c) + a’: (b X ¢) 
= (a X b)-e + (@’ Xb) 
= {(a X b) + (@’ X b)}-. 


Since ¢ is arbitrary, we conclude that 
(a + a’) X b = (a X b) + (’ Xb), 


Since the outer product of two vectors is a vector, we might at first ex- 
pect the associative law to hold. To see why (a x b) x ¢ anda x (b X ¢) 
are, in general, different, let us evaluate both expressions, using a procedure 
devised by Coe and Rainich.* Consider unit vectors e and f in the plane 
ab, orthogonal to b and a respectively, as in Figure 17,15. Since the vec- 
tor a Xx b is perpendicular to the plane ab (or ef), the two vectors 


(axb)xe, (axb)xf 


lie in this plane and have the same length |a x b|, which may be expressed 
in either of the forms 


a-e |b, b-f lal. 
Since they have the same directions as b, —a, respectively, they are exactly 
(a X b) X e = aeb, (ax b)x f= —b-fa 
If g is perpendicular to the plane, we have also 
(axb)xg=90. 


Using the three vectors e, f, g as a basis, we may express an arbitrary vec- 
tor ¢ in the form 17,11; thus 


(a X b) X ¢ = (a X b) X (Xe +: VF + 2g) 

=x(axb)xe+y(iaxb)xf+zaxb)xg 
=x(a:e)b — y(b-f)a = (a-xe)b — (by f)a 
=a'(xe + yf+ Zg)b — b’(xe + yf + Zg)a, 

since a-f, a*g, b-e, b-g are all zero, Hence, finally, 

17.12 (a X b) X ¢ = (ac) b — (bse) a. 

Interchanging a and ¢, we deduce 

aX (b X ¢) = (¢ X b) X a = (c-a)b — (bsadc 
= (ac) b — (a-b)c. 


By considering {(a x b) x ¢}-d, we find also that any four vectors a, b, ¢, 
d satisfy Lagrange’s identity 


17.13 (a X b)-(¢ xX d) = (a+¢)(b-d) — (b-¢)(a-d). 


*C, J, Coe and G. Y. Rainich, American Mathematical Monthly, 56 (1949), pp. 175-176. 
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It is sometimes desirable to express a vector in terms of its components 
in the directions of the axes of rectangular Cartesian coordinates, that is, to 
let the coordinate symbol (x, ys z) for the point P to be used also for the 
vector OP, where O is the origin (0, 0, 0). In other words, we use P = 
(x, y, Z) as an abbreviation for 


17.14 r= xi+ yj + 2k, 


where i, j, k are unit vectors along the three axes (so that this is a special case 
of 17.11). Since 


?=-pPo=k=l pk =k-i=i-j = 0, 
=j{x k, i-—k xi, k=i Xi, fijk] = 1, 


17.15 


we easily deduce, for any three vectors r, r’, r’’, the products 


17.151 rer = xx’ + yy’ 4 22’, 
ro y . 2a Xx y 
rxealy Bit] > vlitl yy 
and 
xX yp 2 
17.16 [reer’ja=l|x yo 2 
alt y Ads 


Since the product of two determinants (like the product of two matrices) 
is obtained by writing down the inner products of the rows of the first with 
the columns of the second, we can bring in three more vectors such as 
q = ui + vj + wk and find 


u w xx x 

[q q’ qq’ [r r r’’] = ul , w’ y y y” 

uv 44 w" ee 2 gz 

rr a 

17.17 =| q@er qr q'er” 
q” q’ red q” jell 


Returning to 17.14, we observe that 
ri=x, repay, mk=z, 


Thus we can express any vector r in terms of any orthogonal trihedron of unit 
vectors in the form 


17.18 r=(rDit(epit(kk 


We shall also have occasion to use the following theorem: 
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17.19 If two vectors, a and b, lie in perpendicular planes which intersect 
the line of a unit vector k, then 
(a: k)(k-b) = a-b. 
Proof. Since the planes ak and kb are perpendicular, 17,13 yields 
0 = (a x k)+(k X b) = (a-k)(k-b) — k2(@ +b) =(a-k)(k-b) — (ab). 


EXERCISES 


1. How must a, b, ¢ be related in order to satisfy the associative law (a X b) x ¢ 
—ax (bx c)? 

2. Simplify (a x b) x (¢ x d) two ways and, by equating the results, deduce an 
identity connecting four vectors such as [a b c] d, 


3. Simplify (a x b)-(a x b), and show that the result could have been anticipated 
in virtue of a well known trigonometrical identity. 


17.2 VECTOR FUNCTIONS AND THEIR DERIVATIVES 


Vector functions can be differentiated in the same manner as numerical 
functions, Let the vector 
a= a(s) 
be a function of the numerical variable s, and let Aa be the increment in 
the vector corresponding to the increment As in the variable s, so that 


ais + As) =a + Aa 


If the vector Aa/As tends to a limit as As tends to zero, the vector function 
a(s) is said to be differentiable, and the limit is the derivative: 
da A a(s + As) — a(s) 


a = — jim 29 = lim 
~ ds aso As ds 0 As 


The rule for differentiating a product is the same as for ordinary functions. 
In fact, 


(a + Aa)-(b + Ab) — a-b = a- Ab + Aa-b + Aa-Ab 


and therefore 
& (a:b) = a-b +a-b— lim ab As 
= a-b + a-b. 


Similarly, a (ma) = ma + ma 


and (a X b) = (a x b) + (@ X b). 
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(Since outer products are anticommutative, we must be careful not to write 
the second term on the right as b X a.) 
Since 


a variable vector of constant length is always orthogonal to its derivative. 
Since the Cartesian basic vectors i, j, k are constant, their derivatives are 
zero, and 


r= # (xi + yi + Zk) = xi + Yi + Zk = (%,}, 2). 


Thus, when we differentiate a vector, the components of the derivative are 
simply the derivatives of the components. 


EXERCISE 
When a particle moves in a circular orbit (like a stone swung at the end ofa string), 
its position vector from the center has constant length. In which direction is its veloc- 
ity? Ifits speed is constant, its velocity is a vector of constant length. In which di- 
rection is its acceleration? 


17.3. CURVATURE, EVOLUTES AND INVOLUTES 


The simplest instance of a variable vector is the position vector r = OP 
of a point P that moves along a curve (including a straight line as the simplest 
case of all). To define the length of an arc of the curve, we approximate it by 
a sequence of broken lines, as in §8.5. The increment Ar may be identified 
with the vector along one of the segments of the broken line, so that, before 
we pass to the limit, the corresponding increment of arc is the length |Ar]. 

For most purposes, the directed arc s (measured along the curve from a 
fixed point A to a variable point P) is the most convenient parameter to 


as 
use in describing the curve. That is, we regard the vectorr = OP as a func- 
tion of s. Since 


2 . , 
lim lan = tim|{ 4*)* 4 (2 ae (42) i = (x2 4 y2 422) = | 
[Struik 1, p. 7], the limit of Ar/As is the unit tangent vector 

t= r, 


If another parameter uw is used instead of s, we can easily make the neces- 
sary adjustment, The derivative dr/du is still a tangent vector, namely 


de _drds _ dsy. 
duo ds du du’ 


the connection between s and wu is determined by the length, ds/du, of this 
vector, and tis the unit vector in the same direction. 
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For instance, in the case of the circle 
xXx = pcosy y=psinu, 
of radius p, we have 


r = p(cos y, sin wu), 


d. 
Aa t = p(—sin y, cos wu), 
as = p, t = (sin u, cos u). 


For any curve in the (x, y)-plane, the tangent vector is 


17.31 t = (cos y, sin y), 


where y is the angle that t makes with the vector i along the x-axis. The 
curvature of the plane curve is the arc derivative of this angle: 


dy : 
17.32 =—=y, 
ds v 
Since t is a unit vector, its derivative is in the perpendicular direction, that 
is, in the direction of the unit normal vector n = (—siny, cosy), which is 


derived from t by a positive quarter-turn, Thus 
t= W(—sin y, cosy) 
17.33 = kn, 


and we regard « as being positive or negative according as n is on the con- 
cave or convex side of the curve. 

The derivative of n, being orthogonal to n, is a certain multiple of t. By 
differentiating the inner product n-t, which is zero, we find the precise ex- 
pression 


17.34 n= —xt. 
Applying this method to the circle 
r = p(Cos yu, sin u), 
for which t = (—sin u, cos u), we find 


kn = t= u( — cos u, —sin u), 
whence 
k=u=1/p and n= —(cosy,siny), 


This means that the curvature of the circle is the reciprocal of its radius, 
Soddy’s “bend” (p. 15), and its normal is towards the center along the radius. 

At a point P on any plane curve, the center of curvature P, is the center 
of the circle of curvature, which is the circle of “closest fit,” having the same 
normal and the same curvature, Its “radius” 


THE EVOLUTE OF A CURVE 315 
17.35 po = I/k 


(which we allow to be positive or negative with «) is the radius of curvature. 
The center of curvature at P, being distant p from P along the normal there, 
has the position vector 


17.36 r=r+opn. 


When P moves along the given curve (which we now assume not to be a 
circle nor a straight line), the center of curvature P, moves along a related 
curve called the evolute,* which may be expressed parametrically in terms 
of its own arc length se, Its unit tangent t, is given by 


Sete = 4 (r+ pn) = r + pn + pn 
= t — pkt + 6n = pn. 
Since t,. and n are unit vectors, it follows that 
So= tp and t=-tn: 


the tangent at P, to the evolute is the same line as the normal at P to the 
original curve (see Figure 17.3a). Thus the evolute, which we have defined 
as the locus of the center of curvature, could equally well be defined as the 
envelope of normals. 


Figure 17.3a 


Integrating the differential equation ds. = +dp, we find that, for some 
constant a, 


Soe=atop. 


Regarding the line PP, as a rigid bar that rolls (without sliding) on the evo- 
lute, we now see that the end P of the bar traces out the original curve. In 


* For a full discussion of this subject, see A. Ostrowski, Uber die Evoluten von endlichen 
Ovalen, Journal fiir die reine und angewandte Mathematik, 198 (1957), pp. 14-27. 
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other words, the locus of P is an involute of the locus of Pe. We say “an 
involute” rather than “the involute” because different choices of the tracing 
point on the rolling bar yield an infinite family of “parallel” curves, each of 
which is an involute. 

By a change of notation (from re, Sc, te to r, s, tT) we can assert that the 
position vector of a point which traces out an involute of a given curve is 


r+ (a —s)t. 


To find t, n, and « for a particular curve, the procedure that we applied 
to a circle Gust after 17,34) is usually effective whenever the Cartesian co- 
ordinates are given in terms of a parameter. However, in the case of a cen- 
tral conic, the best way to obtain the evolute is as the envelope of normals. 
(See Ex. 3 at the end of § 8.5.) 


Figure 17.3b 


For curves given in terms of polar coordinates, a more geometrical pro- 
cedure may be desirable. For instance, to locate the center of curvature P, 
at any point P on the equiangular spiral 8.71, we observe (Figure 17.35) that 
y= 64 ¢. Since also 


dr dr 


ae cos @ and 1 r cot ¢, 

we have 
, _ dy dO _d0dr _ sing 
ds ds dr ds r ? 

so that 


PP, =p=rcsc@. 
Thus OP, is orthogonal to OP [Lamb 2, p. 337] and Py is (re, 8c) where 
re = rcot 9, 66 = 6+ 4m. 
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Sincer =retang and 6 = 6,—4n, the evolute has the equation 


rtand = ap’, 
Since 
tan $ = pe tan $/log wu =p tan @/cot @ =a" o log tan : 
this is equivalent to 


r= apo br-tan @ log tan 


which shows that the evolute is derived from the original spiral by a suit- 
able rotation. (This result could have been seen from simple geometric prin- 
ciples, since the dilative rotation that slides the original spiral along itself 
must also slide the evolute along itself.) 

The spiral is its own evolute if the “suitable rotation’ consists of n whole 
turns, that is, if there is a positive integer n for which 


do + tan ¢ log tan @ = 2mm. 
This happens if tan ¢ satisfies the transcendental equation 
xX log x = (2n — 4)a. 


From a table of natural logarithms we see that there is a unique solution 
for each positive integer n, The values n = | and n = 2 yield @ = 74° 39’ 
[Cundy and Rollett 1, p. 64] and @¢ = 80°41’. When n increases,  ap- 
proaches 90° and the spiral acquires a smaller and smaller “pitch.” 
EXERCISES 
1. Find the evolute of the cycloid 
x=u+sinu, = 1+cosu 

(Hint: t = (cos 4u, —sin4u), A synthetic treatment is given by Lamb [2, pp. 351- 
352],) 

2. Find the involute of the circle 

x= cosu, y= sinu, 

beginning at the point where u = 0, 

3. From “simple geometric principles,” the radius of curvature of an equiangular 
spiral is proportional to the arc s, measured from the origin, In fact, 


p = scot®. 


17.4 THE CATENARY 


The catenary is an infinite curve, the idealized shape of a uniform chain 
hanging freely under the action of gravity. The curve evidently lies in a 
plane, which we may take to be the (x, y)-plane with the y-axis vertical, as in 
Figure 17.4a. Let W denote the weight of a unit length of the chain. We 
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Figure 17.4a 


consider the forces acting on arc AP, where A is the lowest point (s = 0) 
and Pis at distance s, measured along the curve. The tangent t at P makes 
a certain angle J with the x-axis i, and the normal n makes the same angle 
with the y-axis j, so that 

ist = fen = cosy, itn = —sin y. 


By considering various points P on the same chain, we may regard the in- 
clination W as a function of the arc s, or vice versa, while conditions at A 
remain constant, The three forces that act on the arc AP are: the tension 
T at P, acting along the tangent t, the tension Wa (equivalent to the weight 
of a certain length a of the chain) along the tangent —i at A, and the weight 
Ws in the direction —j. Since these three forces are in equilibrium, we have 
Tt — Wai — Wsj = 0. 

To eliminate the unknown (and uninteresting) tension T, we take the inner 
product with n, obtaining 


Wa sin Y — Ws cos p = 0, 
whence 
17.41 : s=atany. 


This equation, expressing the arc-length as a function of the inclination yf, 
is called the intrinsic equation of the catenary. To deduce the Cartesian 
equation [cf. Lamb 2, p. 290] we observe that 


dx = ds cos, dy = dssiny 
(Figure 8.52) and make the “Gudermannian substitution” 
cosh u = secy, sinh u = tany 
(Figure 17.45), which implies 


sinh u du = secy tan dd, 
du = secy di. 
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Differentiating 17.41, we obtain ds = asec? y db, whence 


dx = dscosy =asecy dj) = adu, 
dy = dssiny = asecy tanp dp = ad(secy) = ad(cosh u). 


Taking the lowest point A (where s = 0, ¥ = 0, and u = 0) to be (0, a), we 
deduce 


xX = au, y = acoshu 


or, in a single equation, 


17.42 y = acosh = 


v 
om 
sinh w 


1 
Figure 17.4b 


EXERCISES 


1. A uniform chain OP is held at P and hangs over a smooth peg at A, so placed that 
the chain just above A is horizontal and the peg gives it aright-angled bend. Ifthe part 
of the chain from A to P is in the position indicated in Figure 17.4a, where is the free 
end 0? 


2. For the catenary, s = a sinh u and p = a cosh? u, 


2|4 : 
3. Deduce 17,42 ion = secy = [ + ) | and 2 ea cs 
dx a dx a 


4, Obtain intrinsic equations for (a) the cycloid x = u + sin u, y = cos u; (b) the 
parabola y? = 2Ix, 


5. Use the Gudermannian substitution to work out f sec y ay. 


17.5 THE TRACTRIX 


Let us now investigate the involute of the catenary, unwinding from its 
“lowest” point A, as in Figure 17.5a [Steinhaus 2, pp. 212-213]. Since the 
position vector of the general point P on the catenary is 


r = (au, a cosh uv) = a(u, cosh w), 
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OT N- 


Figure 17.5a 


where u is given in terms of s by the relation s = a sinh u, the unit tangent 
vector is given by 

dsdr _ dr : 
—— == a(l, sinhu 
duds du ( ) 
t = (sech u, tanh w), 


acoshut= 


and the position vector of the general point Q on the involute is 


r — st = a(u, cosh u) — a sinh u (sech yu, tanh w) 
= a(u — tanh u, sech uv). 


Thus the involute, which is known as the tractrix, has the parametric equa- 
tions 


17.51 xX = a(u — tanh y), y = asechu 
[Lamb 2, p, 325], from which there is no advantage in trying to eliminate wu. 


Since the unit normal vector at P to the catenary is (— tanh yu, sech u), the 
unit tangent vector at Q to the tractrix is (tanh u, —sech wu), and the position 
vector of the point N at distance a along it is 


a(u — tanh u, sech uv) + a(tanh uw, —sech u) = (au, 0). 


Thus the length of this tangent QN, from its point of contact to its intersec- 
tion with the x-axis, has the constant value a. This is the property that gives 
the tractrix its name: if the (x, y)-plane is horizontal and you walk along the 
x-axis dragging a stone (originally at A) by means of a string of length a, the 
path of the stone is the tractrix. The x-axis is clearly an asymptote. 
Another way of expressing the same property is that the tractrix is an 
orthogonal trajectory of a system of congruent circles whose centers lie on a 
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straight line. E. H. Lockwood* has developed this idea into an approximate 
construction for both the tractrix and the catenary. 


EXERCISE 
Compute p for the tractrix. What is its value at the “cusp” A, where u = 0? 


17.6 TWISTED CURVES 


We saw, in 7.52, that every displacement is a rotation or a translation 
or a twist (that is, the product of a rotation and a translation), As G. Mozzi 
remarked in 1763, this description evidently holds not only for a finite dis- 
placement but also for a continuous displacement: in the most general 
motion of a rigid body, there is at each instant a definite screw axis. In the 
case of a pure rotation, or of the motion of a screw in its nut, this axis remains 
invariant; but in general it is continually changing. For instance, the in- 
stantaneous axis of a wheel rolling along a road is not the line of the axle 
(which is moving as fast as the vehicle) but a parallel line on the road. 

Any rotation may be described by its effect on a variable orthogonal tri- 
hedron of unit vectors which, for reasons that will appear a little later, we 
denote by tpb, so that 


= p? = b? = 1, p>b=b:t=t-p=0, 


17.61 
t=pXb, p=bxt, b=txXp, [tpb]=1 


(cf. 17.15), We regard these unit vectors as functions of a parameters. Since 
the derivative of any unit vector is in a perpendicular direction, the deriva- 
tive of each of t, p, b lies in the plane of the other two and is a linear com- 
bination of them. Differentiating the relation p-b = 0, we see that the co- 
efficient of p in the expression for b differs only in sign from the coefficient 
of b in the expression for p; similarly for the other pairs of vectors. Hence, 
for suitable numbers «, A, 7 (functions of s), we have 


17.62 t=«p—Ab, p= tbh—«t, b=At—-p. 
These derivatives are conveniently expressed in terms of Darboux’s vector 
d =7t + Ap + kb. 
For, we easily verify that 
a-dxXa, 


where a = tor por b or any other vector rigidly attached to the moving 
trihedron [cf. Kreyszig 1, p. 44]. We may even omit the variable vector a 
and write “symbolically” 


* Mathematical Gazette, 43 (1959), pp. 117-118, 
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d 


17. faa ee 
7.63 a 


dx. 

At any point P on a twisted curve, the unit tangent vector * = r can be 
defined in the same way as for a plane curve. But instead of a unique nor- 
mal orthogonal to t, we have a normal plane containing a whole flat pencil 
of normals. Among the unit normal vectors, we give special names to two: 
the principal normal p, in the direction of ¢, and the binormal 


b=1t X p, 


perpendicular to the planetp. Since this plane contains the derivative of t 
as well as t itself, its order of contact with the curve is higher than that of 
any other plane through t. Because of the more intimate contact, we call 
tp the osculating plane at the point P. (It contains the directions of veloc- 
ity and acceleration of a point moving along the curve [Forder 3, p. 131].) 


The formulas 17.62 for the derivatives of t, p, b are applicable, with the 
simplification A = 0 due to our choice of p in the direction of t. Thus we 
have the Serret-Frenet formulas 


t = kp, 
17.64 p = th — «xt, 
b= — "Pp, 


which may be epitomized in the form 17,63 with 


17.65 d = kb + tt. 


The coefficients « and 7 are called the curvature and torsion of the curve (at P). 
When « is constantly zero, t never changes and the “curve” is a straight 
line. As the name “curvature” suggests, « measures the rate at which any 
nonstraight curve tends to depart from its tangent. Like a plane curve, a 
twisted curve has a circle of curvature of radius 1/«, which lies in the oscu- 
lating plane and has its center on the principal normal; that is, the position 
vector of its center is r + pp, where p = 1/« is the radius of curvature. 

When + is constantly zero, the osculating plane never changes, and we 
have a plane curve, withn = p. The torsion (so named by L. I. Vallée in 
1825) measures the rate at which a twisted curve tends to depart from its 
osculating plane. 

The formulas 17.64 were first given by Serret (1851) and Frenet (1852) 
without the vector notation, that is, as formulas for the derivatives of the 
direction cosines of the tangent, principal normal, and binormal. Combin- 
ing them with 

r=, 


we obtain r = kp, Y= kp + «(th — xt), 


whence (rr Fr] = Kr. 


r) =k 


a 
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EXERCISES 
1. For a curve drawn on a sphere, the center of the circle of curvature at any point 
is the foot of the perpendicular from the center of the sphere upon the osculating plane 
at the point. 
2. The tangent to the locus of the center of the circle of curvature of any curve is 
perpendicular to the tangent at the corresponding point on the original curve, 


3. For any twisted curve, [tt E] = 03(e7 — er) = 4 (= 
Is \« 


[bb By] = 1G — wi) = S/H) 
ds\t 


17.7. THE CIRCULAR HELIX 


As we saw in § 8.7, the locus of a point moving in a plane under the ac- 
tion of a continuous dilative rotation is an equiangular spiral. Analogously, 
the locus of a point moving in space under the action of a continuous twist 
is a circular helix (§ 11.5). In terms of cylindrical coordinates (, 6, z), defined 
by 


x=rcos@ y=rsin@, z as usual, 
a twist along and around the z-axis is 
(7, 9,Z) >, O+u, z+ uc): 


the product of the rotation 6 > @ + wand the translation zz + uc. Ap- 
plying this twist to the point (a, 0, 0), we obtain (a, u, uc), Thus the circular 
helix has the parametric equations 

r=a, d= Uu, Z= UC, 
or 

X=acosu, y=asinu, z= cu 
[Weatherburn 2, p. 16]. In other words, the helix, which is the shape of 
the rail of a “spiral” staircase, has the equations 


r= a, z= c0 
Zz 
or x? 4 y2 = QQ, ee tan —, 
x c 


which express it as the curve of intersection of two surfaces: the circular 
cylinder 
r=a, or x? + y2 = @?, 


and the helicoid 
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which is the shape of the ceiling of the staircase (or of a propellor blade) 
[Steinhaus 2, p. 196]. 
Differentiating 
r = (a@ COS 4, a Sin U, CU) 


with respect to s, we obtain 
t = u(—a sin u, a cos y, c). 
Since this must be a unit vector, we have 
uaz l/\fa2 + C2, 


and we shall find it convenient to retain the symbol 4 as a temporary ab- 
breviation for this constant. The Serret-Frenet formulas yield 


kp = t = u?(—acosu, —asinu,0) = —wa (cos u, sinu, 0), 
Kk = Wa = es 
at + c 
p = —(cos y, sin u, 0) (perpendicular to the z-axis), 


b=+tX p= u(csinu, —ccos yu, a), 


—tp = b = uc (cos y, sin u, 0), 


Thus both « and 7 are constant, a result which could have been seen from 
first principles without any appeal to calculus, since the twist that slides 
the circular helix along itself transforms the curvature and torsion at one 
point into the same properties at another point. Conversely, since every 
displacement is a twist, every curve whose curvature and torsion are constant 
is a circular helix if we include, as limiting cases, the straight line (k = 0, 
a = 0) and the circle (rt = 0, c= 0). 

When « and + are constant, Darboux’s vector 17.65, being rigidly at- 
tached to the moving trihedron, is one of the vectors to which 17,63 is ap- 
plicable. Thus : 

d=-dxd=+=0, 


and dis constant. In fact, just as the motion of the tangent at a point de- 
scribing a plane curve is, at each instant, a rotation about the center of curva- 
ture, so the motion of the tpb trihedron at a point describing a twisted curve 
is, at each instant, a twist about a certain line (screw axis) in the direc- 
tion of Darboux’s vector. In the case of the plane curve, the center of curva- 
ture appeared as the center of the circle of “closest fit’: having the same 
tangent and curvature as the given curve. Analogously for the twisted curve, 
the screw axis can be obtained as the axis of the circular helix of closest fit: 
having the same tpb and the same curvature and torsion. Thus the screw 
axis is the line in the direction kb + rt through the point whose position 
vector is 
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r +ap, 


where a, being the radius of the circular cylinder containing the helix, is 
obtained by eliminating c from the equations 


a cs 


= SS 
az 4 2’ a? 4 ¢c2’ 
in fact, Br. ee 
K T 


In the case of a plane curve we have tr = 0, a = 9, the position vector 
r + ap becomes r + pn, and Darboux’s vector becomes xb: perpendicular 
to the plane of the curve. 


EXERCISES 


1. The orthogonal projection of the circular helix on a plane through its axis, such 
as the plane x = 0, is the sine curve 


y = asin@, 
c 


2. Describe the surface formed by the midpoints of all the chords of a circular helix. 

3. The locus of centers of circles of curvature of a circular helix H is another circular 
helix H’, and the locus of centers of circles of curvature of H’ is H itself. For what 
value of c/a (or r/«) will H and H’ be congruent? (It is, of course, sufficient to con- 
sider a single point on H, such as the point where u = 0.) 


17.8 THE GENERAL HELIX 


We have seen that the circular helix is characterized by its property of 
having constant curvature and constant torsion. It is a special case of the 
general helix, which may be defined either as a curve whose curvature and 
torsion are in a constant ratio or as a curve whose tangent makes a con- 
stant angle with a fixed vector. We proceed to prove the equivalence of 
these two definitions. 


Suppose first that the curvature and torsion are in a constant ratio (Le., 
a ratio independent of s), say 


T= CK, 
Then t= KP, b = —Tp = —Ckp, 
ct+b=—90. 


Since this is the derivative of ct + b, the latter is a fixed vector, say a, which 
makes a constant angle with t since 


a-t = (ct + b)-t — a 6 


Conversely, suppose t makes a constant angle 8 with a fixed unit vector k. 
Differentiating the equation 
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t'k = cos £, 
we obtain kp:k = 0, 
Assuming that « 4 0, we have p-k = 0, so that the constant vector k lies in 
the bt plane and makes complementary angles with bandt. Sincet:k = 
cos 8, we have also 

b-k = sin £. 
Differentiating the equation p-k = 0, we obtain 

(tb — «t)-k = 0, 
Tsin 8B — kcos B = 0, 


K 
— = tan B. 
T B 


Lines in the constant direction k through all the points of the curve gener- 
ate a (general) cylinder. Thus the helix can alternatively be described as a 
curve drawn on a cylinder in such a way as to cut the generators at a con- 
stant angle, In other words, it can be obtained by drawing a straight line 
obliquely on a sheet of paper and then wrapping the paper on the cylinder. 


EXERCISES 
1. Using Darboux’s operator 17.63 to differentiate the constant vector k, obtain 
dxk=0, 


Deduce that Darboux’s vector d = xb + rtis parallel to k: its direction is constant 
(though its length, \/«? + 72, may vary). 
2. Find « and 7 for the curve 
x= 3u~— uv, y= 3u?, z= 3u 4 u3, 


and deduce that this curve is a helix. 


17.9 THE CONCHO-SPIRAL 


The spirals described an shells, and called cancha-spirals, are such as 
wauld result fram winding plane lagarithmic spirals an canes. 

Henry Maseley (1801 -1872) 

[Maseley 1, p. 301] 


The two most interesting helices are as follows: (1) the circular helix, 
which is the locus of a point under the action of a continuous twist, so that 
its curvature and torsion are constant; and (2) the concho-spiral, which is 
the locus of a point under the action of a continuous dilative rotation, so 
that its curvature and torsion are both inversely proportional to its arc s, 
measured from the apex O of the cone on which it evidently lies (cutting 
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the generators at a constant angle). A considerable arc of this curve can 
be seen on the shell Turritella duplicata [Wey] 1, p. 68]. Architectural ap- 
plications appear on spires in Copenhagen, notably that of the Stock Ex- 
change Building, where the tails of four dragons are twisted together. 

In terms of cylindrical coordinates, a dilative rotation round the z-axis, say 


(7, 9, Z) > (wr, O+ u, pz), 
applied to the point (a, 0, c), yields the concho-spiral 
ropa, @=u, z= pe. 


To see how the circular helix can arise as a limiting form of the concho- 
spiral, we change the origin by writing z + c for z, and then make c tend to 
infinity and p to | in such a manner that (uw — 1)c approaches a finite number 
b. Instead of r = p“aandz = p"c, we have r = a and 


Z = (pe — pea Blu Dec - ub. 


Thus the limiting form is the circular helix 
r=a, @=u, z=ub, 
EXERCISES 
1. Express the parametric equations for the concho-spiral in terms of Cartesian co- 


ordinates, 

2. Verify from these equations that the tangent t to the concho-spiral makes a con- 
stant angle with the z-axis. 

3. Obtain a formula for the angle at which the concho-spiral cuts the generators of 
the cone 


ac 
4. A familiar model fora cone of revolution is obtained by cutting out a circular sec- 
tor from a sheet of paper and rolling it up so that the center of the circle becomes the 
vertex of the cone, The angle a of the sector and the semivertical angle B of the cone 
are connected by the formula 


a = 2 sin Bp; 
for example, the sector is a semicircle if 8 = 7/6. If sin B = 1/n, where n is an in- 
teger greater than 1, the unfolded form of any concho-spiral on the cone consists of a 
sequence of arcs belonging to n equiangular spirals, 


5. Like any other helix, the concho-spiral lies on a cylinder and cuts the generators 
at aconstant angle, What kind of cylinder is this in the present case? 
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The tensor notation 


In this interlude between the differential geometry of curves (Chapter 17) 
and the differential geometry of surfaces (Chapter 19) we introduce Ricci’s 
famous notation, which is both suggestive and economical. (Without its aid 
the general theory of relativity could hardly have been formulated.) One of 
its simplest applications has no direct connection with differential geometry: 
“reciprocal lattices” are used in both x-ray crystallography (§ 18.3) and the 
geometry of numbers (§ 18.4), 


18.1 DUAL BASES 


The tensar methad ... has the great advantage that if is nat a new 
natatian, but a cancise way af writing the ardinary natatian, 

Harald Jeffreys (1891 - ) 

[Jeffreys 1, Preface] 


As a basis for our vector space (or a frame for affine coordinates), instead 
of e, f, g (as in 17.11) or i, j, k (as in 17.14), it is more systematic to write 
r1, "2,03. Along with this set of three independent vectors we use also the 
dual basis rv, r?, r3, defined in terms of ry, re, rz by the equation 


18.11 reer, = Op, 


where the Kronecker delta 8% is a useful symbol which means | or 0 according 
as a and f are equal or unequal. (The “1, 2, 3” of the dual basis are not ex- 
ponents but “upper indices” or “superscripts,” analogous to the subscripts 
used in the original basis.) Thus r} is perpendicular to the plane rerg and its 
length is adjusted so thatr1+r, = 1; similarly for r? andr, Each r*, being 
perpendicular to two r,’s, may be expressed as an outer product: 


r r. r r r r 
18.12 ri NS, 2 EA = AT 
where, since r**r, = |, 


18.13 : = [ry Yo rs]. 


2909 


DUAL BASES 329 


Since the basic vectors r, are independent, J #0, By 17.17, we have 


rier wlerg riers 1 00 
[ri r? 3] [rp rong] = freer, reerg r?ers} = 1/0 1 OO; = 1, 
reer, rer rors 00 1 
so that 
18.14 [rt r? r3] = J-1 


and the dual basic vectors are likewise independent. Interchanging “uppers 
and lowers” in 18,12, we obtain 


18.15 HhaeJrP xe, r= JP xX et, org = Je x ?, 


EXERCISE 
Deduce 18.14 from 18,12 without using 17.17, (Hint: Apply 17.13 to re, rs, r?, r3.) 


18.2 THE FUNDAMENTAL TENSOR 


Any vector u may be expressed as Su, r* (Meaning uy r! + ugr? + ugr) 
oras Sur, The covariant* components u, and contravariant components 
u* are simply the inner products ur, and u+r*; for 


Ulrg = Luyt*+ re = Duy 6% = ug 
and 
ver? = Sutr,r6 = Dude = ub 


(In such expressions as S\u,r*+rg, it is understood that the summation is 
taken over the index a that appears twice, once “up” and once “down,” and 
not over the index B which only appears once. The sum 3% u, involves 
three values of a, one of which must be equal to 8, so the “incomplete sym- 
bol” 38% serves as a substitution operator transforming u, into Uz.) 

Thus 


18.21 v= Durrgr® = Durr’ ey, 


In particular, this holds when u = r, or r*, in which cases the components 
Ug = ur, and uv? = ur are denoted by 


18.22 Bie SP a Fs, gh = ete ph, 
so that 
18.221 B= Leer, P= Dee re 


*For the whole story of the subtleties that underlie the terms covariant and contravariant, see 
Kreyszig [1, p. 88]. The present treatment was suggested by G, Hessenberg, Vektarielle Be- 
griindung der Differentialgeametrie, Mathematische Annalen, 78 (1917), 187-217. 
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(In the expression >g,,r°, itis B that appears twice, so we sum over B, ob- 
taining g.1 6! + g.or? + 2.363.) The commutativity of inner products shows 
that 


Sa = 86a, ee = gh, 


The connection between the covariant tensor g,, and the contrayariant ten- 
sor g*8 is found as follows: 


Seaye" = Dey anew 
18.23 — 8f, 


Thus the two symmetric matrices || g,, || and || g7° || have as their product the 
unit matrix. The two corresponding determinants cannot vanish, since their 
product is 1. When the coefficients g,, are given, we have, in 18.23 for each 
value of f, a set of three linear equations 


81818 + 80,87" + 93938 = Of (y = 1,2, 3) 


to be solved for the three unknowns g!4, 228, 93°, By Cramer’s rule [Birkhoff 
and MacLane 1, p. 286], the solution is 


g*® = (cofactor of g,, in G)/G, G = det(g,,). 
In particular, if go23 = g31 = 812 = 0, we have 
8 = 1/8a0 
and g23 = g31 = gi2 — 0, 


To express either kind of components of a vector u in terms of the other 
kind, we have 


Ug = U'hy = UD B9 r8 
18.24 = = gy, UP, 
and similarly, u* = = g*8 ug. The inner product of two vectors 
v= fut*r, = Dur and v= Tver, = Dover 
may be expressed in various ways as a bilinear form: 
Uv = Dv rev = Duy, 
viv = Durtev = Du, y%, 
uev= Ss utr Syl rg = TD gag ue v8, 
Urv= Duets S yer? = TD g uy, ve. 
In particular, the length | u| is given by 
ju? =uru= Du, uv 


18.25 => Dp} Sag ue uP => => gre UUs. 
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Let us regard u and v as the position vectors of points having contra- 
variant coordinates (u}, u?, v3) and covariant coordinates (v4, v2, v3), respec- 
tively. Ifu is fixed while v varies, the two equations 


18.26 uv = 0, u'v= 1 


define respectively the plane through the origin perpendicular to u, and the 
parallel plane whose distance from the origin, being the length of the projec- 
tion of v on the unit vector u/ |u|, is 


wel. 

|y| 

The latter plane, passing through the inverse of (u1, u?, v8) in the unit sphere 
vev=l, 

is the polar plane of (u', u?, u3) with respect to the sphere. (See § 8.8.) 


It is sometimes convenient to express the basic vectors in terms of Car- 
tesian coordinates: 


myn = Xai + Vel + Zak. 
Then, by 18,22 and 17.151, 


18.27 Sap = Vath g = XeXeg + Vos + 2028, 
Xy yr 24 

18.28 J=[ryrers]= | x2 ye Ze], 
X3 3 «23 


and the determinant of the fundamental tensor is 


§11 812 813 xX) ya 21 Xy XQ X38 
18.29 G= | 801 822 ge3|= 1X2 ye Ze || v1 yo yal = J. 
§31 832 833 X3 y3 23 Z1 22 23 
EXERCISES 


lL weve ay + v2 ve + wy, 
2, |u|? = gir(ut)? + goo(u2)? + gaa(u3)? + 2ge3 u2u3 + 2gar udul + 2gi2 ulu?, 
Give the corresponding expression for u-v. 
3. Use 18,12, 18,221 and 18,15 to prove 17,12 in the form 
(ry X r2) % ra = Biske — Sear. 


4. =rxr, = 9, 
5. Express det (g**) in terms of G = det (g.¢). 
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18.3 RECIPROCAL LATTICES 


It can fairly be said that the recipracal lattice pravides ane af the mast 
impartant faals available in the study af the diffractian af x-rays by 
crystals, 


M. J. Buerger [1, p. 107] 


The study of x-ray diffraction has confirmed the notion that the symmet- 
rical appearance of a crystal is a result of the symmetrical pattern formed 
by its atoms or molecules. In other words, there is an infinite group of sym- 
metry operations transforming the pattern (regarded as extending through- 
out the whole space) into itself. These operations may or may not include 
rotations, reflections, glide reflections, rotatory reflections or screw displace- 
ments, but in any case the translations contained in the symmetry group 
form a nonempty normal subgroup. This translation subgroup determines 
a lattice whose unit cell contains one or more atoms, The arrangement of 
atoms in the first cell determines their arrangement in all the other cells de- 
rived by the translations. If the cell contains only one atom, we naturally 
choose the origin at the center of such an atom; then instead of an atom 
inside each cell, we shall have one at every vertex, that is, at every lattice 
point, But if the cell contains several atoms there will be several super- 
posed lattices of atoms. Thus a given crystal has a perfectly definite trans- 
lation group, and the lattice becomes definite as soon as we have chosen an 
origin (at the center of an atom or elsewhere), There is still a theoretically 
unlimited choice of unit cells, though in practice we tend to use basic vec- 
tors of roughly equal lengths. However, the volume of the unit cell is defi- 
nite, since it depends on the number of lattice points in a crystal of a given 
size. In fact, any three independent vectors generate a parallelepiped, and 
this is a unit cell whenever it has a lattice point at each of its eight vertices 
but none on its edges, nor on its faces, nor inside. 

The affine theory shows that a sequence of “rational planes’ 13.93 can 
be chosen in infinitely many ways. In Euclidean geometry these are no 
longer indistinguishable: each sequence has its interplanar spacing, which 
can be measured as the distance from the origin to the “first” plane 


18.31 XX + Yy + Zz = 1, 


Each sequence of planes contains all the lattice points, Hence, when we 
compare one such sequence with another, the interplanar spacing is pro- 
portional to the density of distribution of lattice points in one plane of the 
sequence. This idea is physically important because the face planes and 
cleavage planes of a crystal naturally tend to occur among the rational planes 
ofhigh density. Accordingly, we are chiefly interested in the sequences that 
have a relatively large interplanar spacing, On the other hand, the most 
interesting visible points are those at a relatively small distance from the 
origin, The two lattices which we are going to consider are related in such 
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a way that the visible points of either are in directions perpendicular to the 
rational planes of the other and at distances reciprocal to the interplanar 
spacings [Buerger 1, p. 117]. Thus the more important planes of either lat- 
tice will correspond to the more important points of the other. 

The definition in terms of a basis is extremely simple [Coxeter 1, p. 181] 
and the result is easily seen to be independent of the chosen basis. Ifa 
given lattice consists of the points whose contravariant coordinates are in- 
tegers, the reciprocal lattice consists of the points whose covariant coordi- 
nates are integers. In other words, the position vectors are 


u-—Suvr, and v=2y,r 


> 


respectively, where u* and y, are integers and r**rg = 63. The equation 
uv = lor 
uty, + u2yg + uiy3 = | 


(see 18.26) implies that the three u’s are coprime, and likewise the three v’s, 
For each visible point (u1, u?, u®) of the given lattice, this equation may be 
identified with 18.31: it represents a first rational plane of the reciprocal lat- 
tice; perpendicular to u and at the reciprocal distance |u|—1!. Since the dis- 
tinction between “covariant” and “contravariant” is made by an arbitrary 
choice, the relation between the two lattices is symmetric: the first rational 
planes of either are the polar planes (with respect to the unit sphere) of the 
visible points of the other. 

The shapes of the unit cells of the two lattices are determined by the inner 
products 18,22, For the edge lengths of these parallelepipeds, it is conven- 
ient to use the abbreviations 


18.32 Ba = V8ea = |"al, 8° = V8" = |r*|, 
so that the angles between pairs of adjacent edges are the angles whose co- 
sines are 
fax Bai Bie BE BP 
8283 8381 2182" B78 get” glee 
By 18,13 and 18.14, their volumes are J and J-1 where, by 18.29, 
J= VG, G = det (8,2). 


The simplest special case is the cubic lattice consisting of the points whose 
rectangular Cartesian coordinates are integers. In this case 


mrer=i, rna=r=j, ry=r =k, 


the distinction between covariant and contravariant disappears, and the lat- 
tice is its own reciprocal. Other important lattices are obtained as sublat- 
tices of the simple cubic lattice, that is, by putting suitable restrictions on 
these integral Cartesian coordinates. By making the three coordinates of 
each point have an even sum, we obtain the face-centered cubic lattice; and 
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by making them either all even or all odd, the body-centered cubic lattice. 
(These names refer to a larger simple lattice whose points have only even 
coordinates, In this larger lattice the center of a face has two odd coordi- 
nates and the center of a cell or “body” has three odd coordinates.) Each 
of these two lattices is similar to the reciprocal of the other; for, the bases 


1833) n=(01,), m=(1,01, r=, 1,0), 


18.34 r=(-lLilhbh, ®=¢0,-L), fXF=d,1,-) 
evidently satisfy a trivially modified form of 18.11, namely, 
rer, = 255. 

This means that they are reciprocal with respect to a sphere of radius 1/2. 
To make them reciprocal with respect to a unit sphere, we merely have to 
divide all the coordinates of one lattice by 2 (or all the coordinates of both 
by V2). 

For comparison with Buerger [1, pp. 117-127], it is perhaps worth while 
to point out that his 


a,b,c, a*, b*, e*, a, b, c, a*, b*,c*, dinky, Onn, V, V* 
are our 
Fi Pastas 4, 0, 882,838, 28, [vi vy ode 
EXERCISES 


1. Consider two plane lattices derived from one another by a quarter-turn about 
the origin. Exhibit them as “reciprocal lattices” 


wry, + ure and yyrt + yor?. 


(Hint: ris perpendicular to re, and r? to ry; also ri+ry = r? +r.) 

2. Write out the fundamental tensors for the face-centered and body-centered cubic 
lattices with bases 18.33 and 18.34. Sketch the unit cells, which are rhombohedra. 
(The former may be regarded as a solid octahedron with a regular tetrahedron stuck 
on to each of two opposite faces.) 

3. A lattice, in three dimensions as in two (§4.1), has a Dirichlet region (or Vo- 
ronoi polyhedron) consisting of all the points that are as near to the origin as to any 
other lattice point. For the simple cubic lattice, this is a cube; for the face-centered 
lattice it is a rhombic dodecahedron, whose faces are twelve equal rhombi; and for 
the body-centered lattice it is a truncated octahedron, whose faces consist of six squares 
and eight regular hexagons [Steinhaus 2, pp. 152, 157]. 
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18.4 THE CRITICAL LATTICE OF A SPHERE 


Given 0 bounded region K... which contoins the origin O os on inner 
point, we consider ofl those loftices which hove no point except O in 
the interior of K. The lower bound of their determinonts is colled the 
criticol determinont of K.. . ond the lottices for which this lower bound 
is ottoined ore colled the criticol lottices of K. 


Horold Dovenport (1907-1969)* 


Every lattice has a certain minimum distance c between pairs of lattice 
points, and a certain volume J for its unit cell (§ 13.9). This J is called the 
determinant of the lattice, because it is the determinant of the Cartesian 
components of the three basic vectors r,. We proceed to prove that, for a 
given value of c, the minimum value of J occurs when the lattice is face- 
centered cubic.+ 


Figure 18.4a 


Consider any point A of a given lattice whose unit cell has volume J. 
Choose a lattice point B at the minimum distance c from A, and a lattice 
point C outside the line AB, at the shortest distance b (> c)from A. These 
points can always be chosen so that the angle A and sides a, b, c of the tri- 
angle ABC satisfy 


A < 4a, ad>boe, 
z 


and therefore b2 + ¢? a? 


(Figure 18.4a). Let A and R denote the area and circumradius of this tri- 
angle, so that, by 1.53 and 1.55, 


16A2 = —at —b4 —c4 4 2b2c? 4+ 2c%a? 4 2a%b2, 16A2R2 = b2c2q?2, 
The plane ABCis, of course, a rational plane of the lattice. In one of the 


two nearest parallel planes of the same system, there is a lattice point D 


* Recent progress in the geometry of numbers, Proceedings of the International Congress of 
Mathematicians, 1950, vol. I, p. 166. 

+ A. P. Dempster, The minimum of a definite ternary quadratic form, Canadian Journal of 
Mathematics, 9 (1957), pp. 232-234. 
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whose orthogonal projection D; on the plane ABC is not outside the paral- 
lelogram ABA’C. Replacing the triangle ABC, if necessary, by the triangle 
A’BC, we may assume that D, is not outside the triangle ABC. (The pos- 
sible change of notation involves the central inversion that interchanges B 
and C.) Denoting by d the distance DD, from D to the plane ABC, we have 


J = 2Ad. 


Since none of AD, BD, CD is parallel to AB, all of them are greater than or 
equal to the next shortest distance b. Since the triangle ABC has no ob- 
tuse angle, circles of radius R with centers at the vertices overlap in such a 
way that every interior point of the triangle except the circumcenter is in- 
side at least one of these circles. Therefore the distance of D; from at least 
one vertex is less than R, except that it is equal to R when Dy is the circum- 
center. Thus at least one of AD, BD, CD has its square less than or equal 
to R* + d?, and consequently 


R? + d? > B?, 


with equality only when Dj is the circumcenter (and possibly not even then), 
Hence 
J? = (2Ad) 


> 4A2(b2 — R2) = 45%(—a4t — bt — c4 4 2622 4 ca? 4 2a2b2) 

= 18 4 402(b2 — c2)(3b2 + 2c2) + 1b%(a2 — b2)\(b2 4 c? — a?) 
Acs 

=~ 2 ’ 


with equality only when 
R?2 4 d* = b?, ear 
and either 
Gi) a=b or (ii) 6%? +c? = a®, 


Thus there are apparently two “critical” lattices for which the ratio J/c? at- 
tains its minimum value \/4. However, we shall soon see that these two 
are merely different aspects of one: the face-centered cubic lattice. 

In case (i) (Figure 18.45) the tetrahedron ABCD is regular, and we may 
choose Cartesian coordinates 


(0, 0,0), (0,1,1), G,0,), dd, 1, 0) 


Cc A’ 


Figure 18.4b Figure 18.4c 
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for its vertices, in agreement with the basis 18.33 for the face-centered cubic 
lattice. In case (ii) (Figure 18.4c) ABA’C is a square, the base of a pyra- 
mid whose sloping faces (such as ABD) are equilateral triangles. Choosing 
A, B and D as before, we now have C at (0, 1, — 1), yielding the alternative 
basis 


18.41 rv, = (0,1, 1), rs —re = (0,1, —1), rs = (1, 1,0) 


for the same lattice. Thus we have proved that the face-centered cubic lat- 
tice (whose points have integral Cartesian coordinates with an even sum) is 
really the only “critical” lattice. 

By 18.25, the square of the length of the lattice vector 


u= Dur, 
is XD Bap UX UP, 
and c? is the minimum value of this positive definite ternary quadratic form 
when the coordinates u1, u?, v3 are restricted to integral values other than 
0, 0,0. Hence, among all such forms with a given minimum value c?, the 


minimum determinant G = J? = 4c8 occurs when the basic vectors are 
given by 18.33, so that the form is 


(Surry)? = (u? + v8, uw + ul, ul 4+ v2)? 
= (u2 4 u3)2 + (v8 4 ul)? + (ut + U2)? 

In other words, every “extreme” form in three variables is equivalent to 

(ut)2 4 (u2)2 + (v8)? + v2v3 + wud + ude. 
This is a famous result, first proved by Gauss [1, vol. 2, pp. 192-196].* 

EXERCISE 
Using the basis 18.41 instead of 18.33, obtain the equivalent form 
(ud)? 4 (2)? + (8)? $2 4 3U, 


18.5 GENERAL COORDINATES 


The position of a point in Euclidean space may be specified by three num- 
bers in many ways. Rectangular Cartesian coordinates (x, y, Zz) are the most 
familiar; but we have seen (e.g., in § 17.7) that other systems, such as cylin- 
drical coordinates, are sometimes more convenient. Let us use the notation 
(w1, u?, u3) for general coordinates. The essential requirements are that, 
within a suitable range of variation, x, y, z are single-valued differentiable 
functions of ut, u?, v3, while wu, u2, u® are equally well-behaved functions 
of x, y, z. For instance, if (w4, u?, u8) are cylindrical coordinates, we have 


* For an account of the history of extreme forms up to 1951, see Coxeter, Canadian Journal 
of Mathematics, 3 (1951), p. 393. 
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x = ul cos v2, y =u sin v2, Z = us; 
ui = \/x? + y?, u2 = arctan~, ue = Z. 


x 
For arbitrary constants a, b, c, the surfaces 
18.51 ul —a, u2 — b, w—c¢ 
are called level surfaces, and their three curves of intersection such as 

ue = b, w= e 

are called Jevel curves. Through a given point there is usually one level sur- 
face of each kind, and one level curve of each kind, but exceptions are al- 
lowed. For instance, in the case of cylindrical coordinates the level surfaces 
are cylinders uw] = a or x? + y? = a?, planes u2 = bory = x tan b through 
the z-axis, and planes z = c orthogonal to the z-axis. The z-axis itself is 


exceptional because each of its points lies on infinitely many planes u2 = b 
(in fact, on all of them). 


According to the ordinary meaning of partial differentiation, the partial 
derivatives of the position vector 


r= xi+ yj + zk 
are the unit vectors along the Cartesian axes: 


ey org ae 
ax” ay az 


The differential of r, representing displacement in any given direction, is 
dr = idx + jdy + kdz = (dx, dy, dz), 

and the element of arc of any curve in this direction is ds, where 

18.52 (ds)? = |dr|? = dr-dr = (dx)? + (dy)? + (dz)?. 


Instead of regarding r as a function of x, y, z, we may regard it as a func- 
tion of u1, u?, v3. Using a subscript a to indicate a partial derivative with re- 
spect to u* (so that x, = dx/du*, etc.), we have 


18.53 ry = OE = xh + yal + Zak 
and dr =r, dui + ro du? + rgd = Sr, du. 


For a displacement along the level curve u2 = b, u3 = c we have du? = 0, 
du3 = 0, and dr = r,du!. Thus ry is a tangent vector to this level curve. 
Similarly re is a tangent vector to the curve w3 = c, ul = a, and rs is a tan- 
gent vector to ul = a,u? — b. Ata general point in space we thus have a 
definite trihedron ry, re r3 depending on the coordinate system. These basic 
vectors are not necessarily of unit length, and not necessarily orthogonal 
(though they do happen to be orthogonal in the case of cylindrical coordi- 
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nates). The derivatives of the Cartesian coordinates are expressible in terms 
of them: 


18.54 Xg=Neth Ye=lath Ze = tak. 


Ata general point in space, any two of the three basic vectors determine a 
tangent plane to one of the three level surfaces through the point. For in- 
stance, the plane rers touches the surface u1 = a since it contains tangents 
to two curves lying in that surface. Hence the dual basic vectors 18.12, 
orthogonal to the tangent planes rers, rsr1, ryre2, are normals to the level sur- 
faces 18.51: not, in general, of unit length, but adjusted so that 


rer, = 1 (a = 1, 2, 3). 


The notation 18.22 provides the following formula for the element of arc 
ds in the direction of a given displacement dr: 


(ds)? = dr- dr = Xv, du%*X rz du® 
18.55 = SS gap dut du? 
= 811 (du4)? + go0 (du?)? + gs3 (du)? 
+ 2g03 du? du? + 2g3; du dul + 2940 dul du. 


In the special case when w1, u?, u3 are x, y, z, this reduces to 18.52. In gen- 
eral, the coefficients g,, are not constants but functions of the coordinates 
and their derivatives (see 18.27). 

To deal with any given system of coordinates, we work out gag = ra' tp 
from the derivatives 18.53, then obtain g** by taking the cofactor of g,, in 
the determinant G and dividing by G itself. 

Our use of the letter J in 18.13 and 18.28 commemorates the German 
mathematician C. G. J. Jacobi (1804-1851). In fact, for transforming the 
triple integral of a function 


fay, 2) = Fw, uv’, u) 


from Cartesian to other coordinates, we use the formula 


[ [ [a y, 2) dx dy dz = [| [ree u, U8) Ae ay Se dil du di. 


which involves the Jacobian 


4 1 41 
a(x, Y, Z) 
Bia =|x Zo| = [ri ro rag] = J. 
Sul, u2, u8) 2 y2 29 [ri re rs] 
x3 3 23 
EXERCISES 


1. If}, v2, v3 are affine coordinates, they are the components of r with reference 
to three fixed independent vectors ry, re, r3, so that 


18.56 r= Dur. 


(This notation is appropriate since it makes r, = ér/du*.) The components of r with 
reference toi, j, k are 


x= dx, u’, yu ly, u*, gat Dts 
r, = Xai + Yael + Zak is a constant vector for each a, and all the g,, are constants, 


2. Oblique Cartesian coordinates are affine coordinates with the same unit of meas- 
urement along all three axes, so that |r,| = 1. In this case g,, = 1, and gu, is the cosine 
of the angle between r, and rg (which are the axes of the coordinates u* and u4), 


3. Rectangular Cartesian coordinates (with axes rotated to new positions without 
changing the origin) arise when ryrer3 is an orthogonal trihedron of unit vectors, like 
ijk, so that 


18.57 Sag = Sap 


(meaning | or 0 according as a and f are equal or unequal). By 18.54, x, is the cosine 
of the angle between the new axis r, and the old axis i; similarly for y, and z,. Inter- 
changing the roles of the new and old axes in the relation 


Po = Xai + Val ot Zk, 
we obtain 
18.58 P= TV XyKos i= Vala, k= XZ, 4K. 


(Since now r* = ry, the distinction between covariant and contravariant disappears.) 
From 18.56 deduce 


ue SRR = XX t+ VaV t Zq2Z 
so that (paradoxically) 


BUF A a Oe 
Ox ou 


From 18.27 and 18.57 deduce 
XaXp t+ Vale + ZaZp = Sap. 


With the help of 18.58, evaluate = x,2 and 2 y,Z,. (In technical language, these prop- 
erties make 


x1 yi 41 
X2 yo <2 
X3 V3 £3 


an “orthogonal matrix.’) 
4. Find g,, in the case of cylindrical coordinates. Verify that G = J?. 
5. Find g.¢ in the case of spherical polar coordinates, defined by 


x=wsinulcosu?, y= u3sinulsinu2, z= vu cos ul. 
Describe the level surfaces. 
6. Find g,, in the case of confocal coordinates, defined by 
AWWA). B—wyB- wR - wv) 


Ce eo aed (B— C\B — A) 
- (C — wyc — w2yC — u3) 
~  (C=AKC- BY” 


x2 


’ 


z2 


where ui << C< u2 << B< u3 < A (and x? means “x squared”). In this case the level 
surfaces are the central quadrics 
x2 yp 2 
18.59 ” 
A~—dX cs BX % C-—A 

which are ellipsoids u1 = AifA < C, one-sheet hyperboloids u2=AifC <A < B, and 
two-sheet hyperboloids v3 = Aif B<A <A. In fact, ut, v?, u3 are the roots of 18.59, 
regarded as a cubic equation forX [Weatherburn 2, p. 211]. 


l 


’ 


18.6 THE ALTERNATING SYMBOL 


As akind of counterpart of the Kronecker delta, we shall find it conven- 
ient to use the “alternating epsilon” 


er8y = Eapy = 4(B — yy — a)(a — B), 


which is | if aBy is an even permutation of 123, — 1 ifit is an odd permuta- 
tion, and 0 otherwise. This trick provides one of the best ways to introduce 
the theory of determinants [Jeffreys 1, p. 13]: 


x1 y1 21 
18.61 Xg Yo Z| = DED et! x, Ve Zy, 
x3 3 23 


§11 §12 813 
821 822 823 | = DDD e®8” gy, oe Bay. 
§31 §32 833 


From 18.12-18.15 we deduce 


[ra re r,] = Eaay J, [r* r? r”] ae es vies 
Jey Kg =D eagyh’, Jr® XK rf = Dep, 


Since == e747 g,, = 0, it follows that 
Dr XK ry = Det XK Vga? = TVD e748” gig vy /T 


= 0. 
The analogous “two-dimensional” symbol is 
eio=ey,=j—i (@ = lor 2, j = 1 or 2) 
which enables us to write 
§11 812 3 
= >> Aj . z 
§21 822 Soe ee 


(We use Latin or Greek indices according as the range of values is 12 or 123.) 
EXERCISES 
1. Use 18.61 to obtain a formula for the cofactor of x,. Work this out for the case 
when a = 3. 
2 Ifci = c#, TD ej ci = 0. Use the same idea to justify the step 22 e747 g,, = 0 
ih bove evaluation of = r* X r,. 
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Differential geometry of surfaces 


The present chapter extends the notion of curvature from curves to sur- 
faces. This extension is achieved by considering plane sections of a given 
surface, especially normal sections. Through the normal at a given point 
we can draw infinitely many planes; in fact, we can imagine such a plane 
to rotate continuously about the normal. In general, the curvature of the 
section varies continuously. For one of the planes the curvature attains its 
maximum value, for another, its minimum. We shall see that these two 
planes are at right angles, and that the product of the two “principal curva- 
tures” determines the essential nature of the surface. For instance, this 
“Gaussian curvature” is positive for an oval surface such as an ellipsoid, 
zero for a developable surface such as a cylinder or cone, and negative for 
a saddle-shaped surface such as a hyperbolic paraboloid. 


19.1 THE USE OF TWO PARAMETERS ON A SURFACE 


To fix the position of o point on the eorth’s surfoce, we moy give its 
lotitude ond longitude. .. . Through points on the equotor drow me- 
ridions; through points on fhe Greenwich meridion, drow porollels of 
lotitude. The position of © point... is then given by the two curves, 
one of eoch fomily, which go through it... . Eoch point, except the 
poles, ocquires two definite coordinotes. We con generolize this 
method to ony surfoce, or rother fo 0 piece of ony surfoce; we toke 
two fomilies of curves on the surfoce, such thot through eoch point 
goes just one curve of eoch fomily .. . os if 0 fine fishing-net were 
thrown over the surfoce. 


H. G. Forder [3, p. 133] 


A surface f(x, y, Z) = 0 is often conveniently represented by a set of three 
parametric equations 


xX = xu, vu?) y = put, u?), z= 2(u), v2), 


from which the single equation f = 0 could be derived by eliminating the 
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parameters u1, u2. We shall assume, as before, that the functions involved 
are continuous and possess all the continuous derivatives that we need. 

A simple instance arises when we regard x and y as the parameters, so 
that the three equations become 


xXx=w, p= uv’, z= Fu), u?), 


where the expression for z is the result of solving the equation f(x, y, z) = 0 
for z in terms of x and y. Such an equation 


z= F(x, y) 


is called Monge’s form of the equation for a surface. For instance, the sphere 
x? 4+ yp? + z% = | becomes 


zat Vl—x?_y?, 


The square root makes this a clumsy way to investigate the sphere. It is 
far better to take u1 and wv? to be colatitude and longitude, so that 


19.11 x = sin ut cos U2, y = sin uv! sin v2, Z = cos u}, 
(Colatitude ut means latitude 47 — w1.) 

The vector equation for a surface is 
19.12 r= r(ut, u?), 


just as the vector equation for a curve isr = r(u). The essential difference 
is that the curve has only one parameter, whereas the surface has two inde- 
pendent parameters. 

One way to explore a surface is to investigate families of curves lying on 
it. Among these are the parametric curves 


ui—a and u? = b, 


where a and b are arbitrary constants. Through a given point there is usu- 
ally one parametric curve of each kind, but exceptions are allowed, For 
instance, when colatitude and longitude are used on the unit sphere, the 
curves ul = a are circles called parallels of latitude and the curves u? = b 
are great circles called meridians. Almost every point on the sphere lies on 
one “parallel” and one meridian, but the north and south poles lie on all 
the meridians. 

The position vector r, of a point on the surface, is a vector function of 
ul and u?, Using a subscript i to indicate a partial derivative with respect 
to ué, we have 


dr = 1, du) + ro du? = 3 vr; du’, 


where n= or : 
out 


The differential dr may be regarded as a displacement along a given curve 
on the surface or, more precisely, a displacement along a tangent. In the 
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case of the parametric curve u? = b, we have du? — 0, so that dr = ry dul. 
Thus ry; is a tangent vector to this curve, and similarly re is a tangent vec- 
tor to the other parametric curve u1 = a. It follows that the plane ryre, 
spanned by these two covariant basic vectors, is the tangent plane to the sur- 
face at the point considered, which is the point (u1, uw?) = (a, b). In the 
same plane we define the two contravariant basic vectors r' to be normal 
to the parametric curves, adjusted so that 
rier; = Oj, 


In the tangent plane, there are tangent vectors going out from the point 

of contact in all directions. Any such vector 

19.13 t= Dar = sain 

is said to have covariant components a; and contravariant components ai. 

It is easily verified (cf. § 18.2) that 

19.14 a; = try, ater, 

In particular, the basic vectors themselves have components g;;, g/, such that 
nr, = 2 gyn’, ri = & gir, 

19.15 
Siz = Fj" vr, git = ris vr. 


In terms of the fundariental magnitudes of the first order 


811, 812 = 821, 822, 


defined by gi; = ri‘r;, we have the following formula for the element of arc 
ds (of any curve on the surface): 
ds? = dr+dr = 2 rj, dut+3 0; dui 
19.16 = 22g; dui dw 
= 211(dul)? + 2210 dul du? + 822 (du?)2, 


The fundamental magnitudes (which are functions of the parameters uv‘) 
are spoken of collectively as a covariant tensor. The corresponding con- 
travariant tensor g‘/ is given by the last part of 19.15, which implies 


= gin gi = Sh. 
For each value of j, this is a pair of equations to be solved for the two un- 
knowns g## (i = 1, 2). The solution is 
gi? = (cofactor of gi in g)/g, 


§11 812 


where g= = 211822 — (812). 


§21 §22 


Since the number of rows (or columns) in this determinant is only 2, the 
cofactors are single elements, and we have the explicit expressions 
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19.17 git — 822° p12 _ p21 — _ 812 pon _ B11 


g° g g 
It is now easy to derive the covariant components of the tangent vector t 
from its contravariant components, or vice versa: 
artery > at mer = > gi; a’, 
@=tenra=Saerer= = git aj. 
Of course, we are free to interchange / and /, obtaining 
19.18 a = > gy al, ai = & gti a; 
Since 232 = ry*te, where r; and re are tangents to the parametric curves, 
the condition for the two families of parametric curves to intersect at right 


angles is 
Si2 = 0. 


Thus in the case of orthogonal parametric curves we have simply 
§ = 811822, g%=0, gi = 1/gi, 
whence by 19.15, 
ri = gitr, = wy /gi. 
EXERCISES 


1. The basic vector r; has covariant components g;;, contravariant components 6}. 
2. Zri Xr; = 0. Interpret this geometrically in terms of areas of triangles. 
3. Find r! and r? for the general surface of revolution 
r=(ulcosu?, ulsinu?, 2), 
where z is a function of u1 alone. 
4. Find gi; and gi for the unit sphere expressed in terms of colatitude and longitude. 


19.2 DIRECTIONS ON A SURFACE 


Just as a curve in the (x, y)-plane is given by an equation connecting x and 
y, acurve on the surface 19.12 is given by an equation connecting wu! and u?, 
A differential equation determines a family of curves. In general, a first- 
order, first-degree differential equation 


xc, dui = 0 


determines a one-parameter family of curves: one curve through each point 
of general position on the surface, going out from that point in a direction 
determined by du2/du! = —c/c2; for instance, the equation 


du2 = 0 
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determines the “first” family of parametric curves. On the other hand, a 
first-order, second-degree equation 
19.21 => Cy dui dui = 0, 


where Cy2 = Coz and C44 C22 < C22, determines a net of curves: two curves 
through a general point on the surface; for example, the quadratic equation 


du} du? = 0 


determines a net consisting of the two families of parametric curves taken 
together. 
We have seen that the vectors r; are in the directions of the tangents to 
the parametric curves. Since 
r? = rh = Bu, 


their lengths are \/gi;. Because of its frequent occurrence, we shall use the 
abbreviation g; for this square root; thus 


V8u = [ril, 


8i 
and analogously 
gi = V git = |ré|. 


In this notation, the unit tangent vectors to the parametric curves (touch- 
ing du? = 0 and du! = 0, respectively) are 


ry/g, and re/go. 


The angle @ at which the two parametric curves intersect is given by 


19.22 ose = 2 22. 
§1 82 §1 82 
We see from 19.17 that g! = go/1/g, g? = g1/,/g; therefore 
: Vg 1 1 
sin — = — 5 
Sige 818" B28? 
that is, 212! = gog* = csc. 


It follows from the definition of an outer product that the length of the 
vector ry & re is 


19.23 |r1 X ro] = 8182 sing = V8, 


and that the element of area on the surface (naturally defined as the ele- 
ment of area in the tangent plane) is 


19.24 dS = |r; du! % re du?| = vg dul du? 
[Kreyszig 1, pp. 111-117]. The equation 19.23, in the form 
& = (r1 X rp)? 
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A displacement along any curve on the surface is given by 


dr = > 4; dui. 
If s is the arc of this curve, the unit tangent vector is 
19.25 ta Ha Lair = ale; + aes, 
where 
dui ‘ 
19.251 a= i= 1, 2). 
a ( ) 


Thus the arc derivatives of the parameters are the contravariant compo- 
nents oft. We shall not attempt to find a corresponding interpretation for 
the covariant components a;, given by 19.14 or 19.18. On the other hand, 
it is easy to give a geometrical interpretation for both kinds of component. 
Let PQ; and PQ, be tangent vectors to the parametric curves, of such lengths 
that PT, representing t, is a diagonal of the parallelogram PQ,7TQo, as in 
Figure 19.2a. Let t divide the angle¢ = 4 Q,PQz into the two parts @ and 
@ — 6 Let PR,TR» be a parallelogram whose sides are perpendicular to 
the tangents. Since 


Bas = => 

PT =t=a'ry + are = PQ; + PQ» 
—_ 

ayri + aye? = PR, + PRo, 


the lengths of the various lines are: 


PQ; = ga’, PR, = gia, PS, = t-r;i/g; = aj/Q;. 


Ry 


Figure 19.2a 
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The angles are given by 
cos 8 = PS}, cos (@ — #) = PSo. 


By taking the inner product of 19.13 with itself in various ways, we can 
express the obvious relation # = | in the equivalent forms 


19.26 DP gia a= > a; a‘'= 3D 24; a2 @a— li. 


In virtue of 19.251, the last of these relations is a restatement of 19.16. 
Similarly, by working out the inner product of two such unit tangent vec- 
tors 


Lar=Latr, and J br = > dir, 
we obtain various expressions for the cosine of the angle between them: 
19.270 32 gttajb = Labi = Labi = TZ gyai bi = Dat bj. 


Eliminating ds from the two equations 19.251, we obtain the differential 
equation 
a? dul — a1 du? = 0 


for a family of curves whose typical tangent is given by 19.25. Another 
family, cutting all the members of the first family at right angles, has the 
differential equation 


19.28 b2 dul — b1 du? = 0, 
where, by 19.27, 
=z gai bi = 0. 
Writing this relation in the form = a;b/ = 0 or 
ay 61 + ag b? = 0, 
where a; = > gi; a', we find that the equation 19.28 is equivalent to 
a; dui + ag du? = 0. 


In other words, 
The orthogonal trajectories of the curves a? dui — a1 du2 = 0 are the curves 


Sa; dut = 0. 
It follows also that the two families of curves 
x aj dui = 0, = b; dui = 0 
are orthogonal if and only if 
19.29 SE gi a,b; = 0. 


The net of curves given by the quadratic differential equation 19.21 is an 
orthogonal net if and only if 


19.291 == giicy = 0. 
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For, this condition is the same as 19.29 if the quadratic expression factorizes 
in the form 


=> cy dui dul = L a; dut+  b; dui, 


so that Cy + Ce = Aj b; + a; Dy. 
EXERCISES 
1. Use 17.13 to prove that (m1 * re)? = g (cf. 19.23). 
2. Express tan ¢ in terms of the fundamental magnitudes g;; and their determinant g. 
3. In Figure 19.2a, TS, = a?/g? and TS: = a1/g!, 
4. Reconcile the formulas 


cos 9 = ay/gy, cos (@ — 8) = ao/go, 
sin @ = a?/g?, sin (@ — #) = at/gt 
with 19.22. 


5. The net of curves bisecting the angles between the parametric curves (du du? = 0) 
is given by the differential equation 


811 (du)? — goo (du?)? = 0. 
(Hint: Find the condition for the parallelogram PQi7Q2 to be a rhombus.) 

6. Interpret the equation 19.21 in the case when cy, = co2 = 0. What does the 
condition 19.291 tell us in this case? What does it tell us about the curves described in 
the preceding exercise? 

7. Use 19.24 to prove that the area of the unit sphere 19.11 is 4a. 


19.3 NORMAL CURVATURE 


The unit zormal vector n at a point P on the surface is naturally defined 
as the unit vector perpendicular to the tangent plane in such a direction 
that the three vectors rj ron form a right-handed trihedron. In virtue of 
19.23, we have rr, X re = Vg n, so that 


19.31 n= eT 
V8 
[riron] = V8, 
and 
19.32 mnxX ry = ey; Vgn (ej; = j — i). 


Identifying this trihedron ry rg n with the ry re rg of § 18.1, we see from 18.12 
that 
a gene EL =n Ps. 


V8 


Thus 
19.33 nxn=> Ex; V ri. 
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The tangent plane at P contains a flat pencil of tangents 
a a'r; 


each of which determines a normal plane tn. The section of the surface by 
such a plane is called a normal section; it is a plane curve whose curvature 
at Pis called the normal curvature at Pin the direction t. Differentiating 
with respect to the arc s of the normal section, we obtain 

- dy; da‘ , a 

t= — Dain, = Dr; + Sai— rj. 

de OS gg OO ger 

By 17.33, this is kn. Since nis perpendicular to r;, inner multiplication by n 
will eliminate the first sum on the right. (To differentiate r; we use the 
operator 


where af = dui/ds.) Thus we are left with 


comm =tenad (ai fy, 


on 
s 
aS a Sai Zr) We Sea aS PS aioleeen 
oul dui dui aie 
Introducing the important notation 
19.34 by = rnin (GJ = 1, 2) 


we now have the simple formula 
19.35 K= > bi aiqi 
for the normal curvature in the direction = air;. Since r;; is a second deriva- 


tive, 


The three functions 544, by2, bee are known as fundamental magnitudes of the 
second order. Like those of the first order, they occur as coefficients in a 
quadratic differential form: 
19.36 kds? = 3D bi; dut dui 
= by,(du1)2 + 2by. dui du? + beo(du?)?. 

(It must be remembered that the normal curvature « depends on the direc- 
tion of the tangent, and therefore on du! : du?.) 

Differentiating the identity r;>m = 0, we obtain 

ren + men; = 0, 

whence by = —rtony = —my ery. 


Along with the “covariant tensor’ b,;, we shall sometimes find it convenient 
to consider the ““mixed tensor” 
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19.37 bf = 3 gik by = — = gikryeny = —reen; 
and the “contravariant tensor” 
19.371 bik = & gii bk = IE gii gk by, 

= SD gi! git by = D gk bh = Dki, 


The derivative n;, being perpendicular to the normal n, is a tangent vec- 
tor, capable of being expressed as a linear combination of the basic vectors 
ri orr;. Since its covariant and contravariant components are 


neer; = —biy, nw = —5), 
the expressions are 
19.38 nj = —Lbyri = = —Dbir,. 
We have thus established the “Weingarten equations” 
ny = —biry — dire, 
no = —bory — b3 re, 


which express the derivatives of the normal n in terms of the derivatives of 
the position vector r. 


EXERCISES 


1. Evaluate rt x r?. 


2. Work out b;; for the unit sphere in terms of colatitude and longitude. Verify that 
the normal curvature is the same in all directions and the same at all points on the 
sphere. (Hint: Sincen = r, by = —gi;.) 


19.4 PRINCIPAL CURVATURES 


Take a unit sphere and draw the radius parallel fo fhe normal at a 
paint Pof [a given] surface. The radius meets the sphere in the spher- 
ical representation of P. Clearly we must distinguish between the two 
sides of the surface, and draw the normal on the selected side. By this 
representation, to a curve on the surface corresponds, in general, a 
curve on the sphere, and fo a piece, a piece. But as normals to the sur- 
face at different points may be parallel, pieces on the sphere might 
overlap even when they correspond to non-overlapping pieces on the 
surface, But if we take pieces an the surface, not too large, this will not 
occur... . To a small piece round P on the surface will correspond a 
small piece on the sphere, and the ratio of the area of the latter fo the 
area of the former, as these areas shrink to zero, fends to the total cur- 
vature at P. 

H. G. Forder [3, pp. 139-140] 


Consider a variable plane through the normal at a point P ona A given 


nae fa nn Baw ann bh sitet eee See ee ae a eee oe ear IE ie ee 
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« given by 19.35. In exceptional cases (e.g., at the north and south poles of 
a spheroid) it may happen that « remains constant; such a point P is called 
an umbilic. If P is not an umbilic, a continuous rotation of the normal plane 
nt makes « vary in such a way as to return to its original value as soon asa 
half-turn has been completed. With the help of 19.26 we may express 19.35 
in the homogeneous form 


KID gyaiai = 2D byatai 


or 
19.41 D2 (bi _ Kei at a=0. 
This exhibits « as a continuous function of the ratio 
@ _ dv 
ai” dul’ 


which determines the direction of the tangent 
t = dain; 


In the course of its continuous variation, the normal curvature « must attain 
at least one maximum and at least one minimum. We proceed to prove that 
there is just one of each, and that they occur in perpendicular directions. 
The maximum and minimum values of « are called the principal curvatures, 
the positions of tin which they occur are called the principal directions, and 
the curves whose direction is always principal are called (perhaps not too 
happily) the Jines of curvature. 
As a temporary abbreviation, we write 


Cy = by — KBij, 
so that cj; = c;. To find the principal curvatures and principal directions, we 
may differentiate 19.41 and then set dk = 0 or, more conveniently, differ- 


entiate 19.41 regarding « as a constant. Since b,; and g,;; depend only on 
the fixed point P, this means that we differentiate 


=z cat a=-0 


treating the coefficients cj; as constants. Differentiating partially with re- 
spect to a*, we find 
a i an . dat 
DP tye a= > Ci (24 aga+_a a 
a 


dak dak 
= 22 cj; (diai + ai 81) = Degai + Dex at 


= =(Cri + Cina? = 23> Cik a‘. 
Restoring the proper expression for ci,, we deduce 
19.42 > (dix — Kix) at = 0 (k = |, 2). 
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Multiplying by g/* (and summing over k) to eliminate the coefficient g;;,, we 
obtain 


19.43 oS (bi = Kdi) a = 0, 
that is, 
19.44 = biai — kai = 0 G = 1, 2). 


From these two equations we can find the principal curvatures by eliminat- 
ing a?/a1, and the principal directions by eliminating «. 
When written out in detail, the equations are: 


(b} — wat + bia? = 0, 
b2 al + (63 — K)a® = 0. 


Eliminating a?/a1, we obtain 


bi—«k by | 0 
b? ae 
that is, 
19.45 K2 — Dbi« + det (bi) = 0. 


This quadratic equation has for its two roots the principal curvatures k,1,, 
Kg), whose product and arithmetic mean are known as the Gaussian curva- 
ture K and the mean curvature H. Thus «,1, and k,2, are the roots of the 
equation 


Kk? — 2Hx« + K = 0, 


where 
19.46 2H = ka, + Ka) = Db = dL + 05 
and 
19.47 K = ka, ka, = det (b}) = bt b2 — b} b?. 
Since 
axe [oe eel [ot al = [ee oe] <5 


say, another expression for K is the ratio of the two fundamental determi- 
nants: 


19.471 K=-, 
g 
When K is positive, the normal curvature (never going outside the range 
from k,1, to K,2)) has the same sign in all directions; the tangent plane at P 
meets the surface “instantaneously” at P and not anywhere else in the 
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neighborhood of P. The surface is then said to be synclastic (or “oval”). 
Ellipsoids, elliptic paraboloids, and hyperboloids of two sheets are every- 
where synclastic. 


When K is negative, the normal curvature changes sign twice (during the 
rotation of the normal plane through a half-turn about the normal at P); 
therefore it is zero in the directions of two special tangents, called the in- 
flectional tangents at P. The surface crosses its tangent plane, and its sec- 
tion by this plane is a pair of curves that cross each other at P, the two 
tangents at this “node” being the inflectional tangents. 

A practical instance is the general shape of the ground at the top of a 
mountain pass. The tangent plane is the horizontal plane, which touches 
the curve of the footpath and cuts into the ground on both sides. The fact 
that the tangent section has a node is seen in a map on which contour lines 
are marked. The mountain pass occurs where one of the contour lines 
crosses itself [Hardy 1, p. 65]. 

Such a surface is said to be anticlastic (or “saddle-shaped”). Nondegen- 
erate ruled quadrics (namely, hyperbolic paraboloids and hyperboloids of 
one sheet) are everywhere anticlastic. 

Surfaces more complicated than quadrics may be synclastic in some re- 
gions and anticlastic in others. Regions of the two kinds are then separated 
by a locus of parabolic points, at which K = 0. Hilbert and Cohn-Vossen [1, 
p. 197, Fig. 204] show a bust of Apollo on which the curves of parabolic 
points have been drawn. They are quite complicated, especially round the 
nose and mouth. 

Surfaces on which K = 0 everywhere are said to be developable. Such sur- 
faces include cones and cylinders, and also the surface traced out by the tan- 
gents of any twisted curve. 


The Weingarten equations 19.38 provide a useful expression for the Gaus- 
sian curvature as a triple product: 


19.48 K = [nnz ne]/V. 
In fact, 
[nym] =[n Thi; Thon] = VD bsbe (ne; ei] 
= D2 bi bkex Vg = det (bi) vg 
=Kvg. 


Another expression, involving an arbitrary unit tangent vector t, was dis- 
covered by A. J. Coleman: 


19.49 V8 K = XS ei [n t ti];, 


where the final subscript indicates differentiation with respect to ui. This 
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is deduced from Lagrange’s identity 17.13 by introducing another unit tan- 
gent vector m = n X t,so thatn = t x mand 


[mn ny ne] = (t X m) (My X Ng) 
= tony Meno — t'noM'n, 
= 22 ev ten; men. 


Differentiating t-n = 0, m+n = 0, and using 17.19, we see that 


ten; msn; = ten men = teansm; 
=tem = (t;*m); — tym, 


Since 23 e% t;; = 0, it follows that 


VEKk = D> et (t;°m); = >> eli (m > t;); 
SE efi[n t ti}. 


(We have interchanged the t and m of Kreyszig [1, p. 146].) 


Since ny X negis parallel ton, 19.48 may be expressed in the form 
| my x n2| | |K| Vg, 
which can be used to establish Gauss’s geometrical interpretation for K. To 


obtain his spherical representation of a surface, Gauss considered the locus 
of the end Q of a vector 


wy 
676) =n, 
where O is a fixed point and n is the unit normal at a point P which varies 
on the given surface [Hilbert and Cohn-Vossen 1, pp. 193-196]. When P 
travels over a sufficiently small region F, bounded by a simple closed curve 
on the surface, Q travels over a corresponding region G of the unit sphere 
with center O. Gauss defined the total curvature of the surface at P to be 
the limit of the ratio of the areas of Gand F when these regions are shrunk 
to single points. By 19.24, the area of F is 


ffir dul x rp du? | = [[vsgaw du®, 
Analogously, the area of G is 
[fins aut X ne du?| = [[|K] vg aut due. 


Hence the limit of the ratio is | K|. 

The characteristic property of a developable surface is that, instead of a 
two-parameter family of tangent planes, it only has a one-parameter family 
of tangent planes, and so also a one-parameter family of normals. In this 
case G is not a proper region but merely an arc, and therefore K = 0. 


If the parametric curves are orthogonal, so that g,2 =0, we have 
gt = 1/944, gi2 = 0, g22 = 1/222, whence, by 19.37, 


bi, -> gik bux = git by = bij/ Bi. 
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In this case the mean curvature H is given by 
a $11 822 
EXERCISES 


1. Find the mean curvature H at a given point on the helicoid y = x tan (z/c), 
parametrized in the form 


x= ulcosu2, y= ulsinu?, z= cu?, 
2. Find the mean curvature H at a given point on the general surface of revolution 
r=(ulcosu?, ulsinu?, 2), 
where z is a function of u! alone. This mean curvature is zero when z is given by 
ul = acosh(z — c)/a, 
ie., when the surface is a catenoid, 
3. Locate the curves of parabolic points on the torus 


X= (4+ bcosu!)cosu2, y= (a+ bcosu')sinu?, z= bdsinul, 
4. The tangents t = r ofa twisted curve r = r(s) generate a surface 
r(s,u) = r(s) + ut(s). 
Using s and wu as parameters, obtain the fundamental magnitudes 
by, = KtU, big = bog = 0. 
Deduce that K = 0 everywhere, 
5. The mean curvature and Gauss curvature are connected by the inequality 


H? > K, 
At what kind of point do we find H2 = K? (Hint: H? ~ K =1(«4, — «2,)) 


6. Derive 19.48 another way, by applying Lagrange’s identity to (r, ¥ r2)*(m, & Ne). 
7. Derive 19.49 another way, by applying 17.62 in the form 


{= ym — wn, m, = An — vit, ny = wat — Aum 
(where Aj, 4, »; are functions of wu! and u?), so that 


[n n, ne] = Ape = Ag = Mot, — m,° fe. 


19.5 PRINCIPAL DIRECTIONS AND LINES OF CURVATURE 


Returning to 19.44, which may be expressed as 
= bk ai = Kak (kK = 1, 2), 


we find that the easiest way to eliminate « is to multiply by = e,, a‘ and sum 
over k, obtaining 
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SS> ey, beat ai = « ZZ ej, ai ak = 0. 


(This sum is zero because the only nonvanishing terms involve €12 a! a? and 
&21 a2 a1, which cancel.) Thus the principal directions = a‘ r; are given by 
the roots of the quadratic equation 


DDD ei, bk aiad=0 


for a!:a?. In other words, the lines of curvature are determined by the dif- 
ferential equation 


19.51 DZD Eix bk du’ dui — 0 
or 
19.52 — b2 (dus)? + (b1 — b3) du) du? + bi (du?)? = 0. 


To prove that the lines of curvature form an orthogonal net, we may ap- 
ply the criterion 19.291 either to 19.51 or to 19.52, using 


cy = = Gtk bk. 
We obtain, in the notation of 19.371, 
22 gti cy = DED ey, gt bk = 33 ej; bik — 0. 


It follows that, at any point P on a surface, the two principal directions are 


perpendicular. 
Another consequence of 19.44 is Rodrigues’s formula 
19.53 dn + «dr = O, 


which shows what happens to the normal n when r is displaced in a prin- 
cipal direction. (The coefficient « is the corresponding principal curvature.) 
In fact, by combining the Weingarten equations 


nj = —Zdin; 
(19.38) with 19.44 in the form 
19.54 D bi dui = « dui, 
we obtain 


dn = nj, dui = —ZSD bir, dui 


—K2i0; dui = —«k dr. 


(Olinde Rodrigues, 1794-1851.) 

It follows that, if dr is in a principal direction, dn is in the same (or the op- 
posite) direction. Moreover, the principal directions are the only directions 
in which this happens. For, if dn is parallel to dr, the analysis given above 
shows that, for some number A, 


2> bin; dui = \ Dey; dui. 
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Writing this in the form 
2(= bi dui — d dui) r; = 0, 
we see that it implies 
> bi dui — rv dui = 0 VG = 1, 2). 


Eliminating A, the way we eliminated « before, we again obtain 19.51, which 
is the differential equation for the lines of curvature. 


If the parametric curves are orthogonal, so that b} = 5;;/g,;, the equation 
19.52 becomes 


19.55 P32 (gyry? + (2s - v2 du) du? + P1222 = 0. 
$22 §11 $22 $11 


For the investigation of a given surface, any net of curves on the surface 
may be used as parametric curves. The lines of curvature provide a stand- 
ard net which is always available. Comparing 19.52 with dui du? = 0, we 
see that 


19.56 b? = bi = 0. 


Hence 


The parametric curves are the lines of curvature if and only if b} and b} are 
identically zero. 
In this case the equation 19.45 reduces to 


(kK — bi) (k — BS) = 0, 


so the two principal curvatures are b} and b3. To see which is which, we 
apply Rodrigues’s formula 19.53 to displacements along the parametric 
curves. The “first” principal direction is naturally the one along which u} 
varies while uw? remains constant, that is, the direction of r1; and the “sec- 
ond” is the direction of ro. Thus 


19.57 n+ky hi = Oo. 
Taking inner products with r/ and r;, we deduce 

bE + Ka 8h =0, by + Kay By = 0. 
Hence the two principal curvatures are precisely 


bi bee 
19.58 Ka) = DA = faa K (2) b3 = G55. 


and we see also that 


Dy2 = Ka, S12 = O 


(since the lines of curvature are orthogonal). 
Conversely, if the parameters on any surface are so chosen that 


LINES OF CURVATURE 359 


19.59 812 = 0, by = 0, 
then the parametric curves are the lines of curvature; for, since 
§12 
gai = - C= 0, 
& 


we have 
De = > gi by; = gre bi2 = 0 
and likewise b} = 0. In fact, the conditions 19.59 are equivalent to 19.56. 


EXERCISES 


1. Apply 19.291 to 19.52, so as to prove the orthogonality of the principal directions 
without using the symbol ejx. 
2. The differential equation for the lines of curvature may be expressed in the form 


811 §12 S22 
bit Dye boo = 0. 
(du?)2 —du du? (du)? 
3. Find the lines of curvatures on the hyperbolic paraboloid x? — y? = 2z, para- 
metrized in the form 


x = sinhu! + sinhu?, y= sinhw) — sinhu?, z = 2 sinh wu) sinh v2. 
4. Find the lines of curvature on the helicoid y = x tan (z/c), parametrized in the 
form 


xX =ulcosu®, y=ulsinwu?, z = cu, 


5. The equations 19.56 or 19.59, holding at a particular point (but not necessarily 
identically), are conditions for the parametric directions to coincide with the principal 
directions at the point considered. The formulas 19.58 still hold at this point. 


19.6 UMBILICS 


An umbilic is a point at which the normal curvature « is the same in all 
directions. At such a point, the equations 19.42, 19.43 are satisfied for all 
values of a1: a?, and therefore 


biz = Kin, bi = Kd. 

In fact, we have two alternative sets of conditions for an umbilic: one set is 
bi1: by2: bez = 211: 812: B20, 

and the other, 

19.61 bt = bk = 0, bt = Bi. 


If a surface is symmetrical by reflection in a plane, its section by the plane 
is a line of curvature. To see why this happens, consider a point P on the 
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section. If the principal directions at P were oblique to the plane, either of 
them would reflect into another principal direction associated with the same 
principal curvature. Such an abundance of principal directions would make 
Panumbilic. But a curve consisting entirely of umbilics is, trivially, a line of 
curvature. 

In particular, on any surface of revolution, the meridians and parallels are 
lines of curvature. An interesting special case arises when we rotate a plane 
curve about one of the normals of its evolute; that is, if C is the center of 
curvature for a point P on the plane curve, we rotate about the line through 
C parallel to the tangent at P. In this case the locus of P is an “equator” 
whose radius is equal to the radius of curvature of the meridian. The equa- 
tor, like the meridian, is both a line of curvature and a normal section. Since 
the two principal curvatures are equal, every point on the equator is an 
umbilic. 

At an umbilic, Rodrigues’s formula 19.53 holds for all displacements on 
the surface. In particular, we can apply it to displacements along the para- 
metric curves, obtaining 


19.62 nj + KY; = 0 G = 1, 2). 


Hence 


19.63 Ifevery point is an umbilic, the surface is either a plane or a sphere. 
For, in this case 19.62 holds everywhere. Differentiating, we deduce 


ni + Kry +; 0; = O, Ky Wo = Ke ly, 


and ky = kg = 0. Thus x is constant, and 19.62 yields (m + xr); = O, so that 
n+ «risconstant. If« = 0,nis constant, and we havea plane. Ife ~0,a 
suitable origin makes r = —k~1n, |r| = |«|—1, and we havea sphere. 


EXERCISES 
1. How can the equation 19.52 be used to derive the conditions 
b= bl=0, bl=B? 
for an umbilic? 
2. What happens to the equation 19.45 when these conditions are satisfied? 


3. Anticlastic surfaces have no umbilics. 
4. The surface 


X= V2cosul, y=V/2cosu?, z= sinulsin wu? 
has a curve of umbilics lying on the sphere x2 4 y2 4 22 = 4. 
5. Does every umbilic lie on infinitely many lines of curvature? 
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19.7. DUPIN’S THEOREM AND LIOUVILLE’S THEOREM 


Dupin investigated triply orthogonal families of surfaces, not as a bar- 
ren exercise in the differential calculus but because certain instances of 
such families are of the first importance in... mathematical physics. 
[They] were the occasion for one of Darboux' more famous works, 
extending to 567 pages. 


E. T. Bell (2, p.331] 


In the exercises at the end of § 18.5, we found several instances of three- 
dimensional coordinates having the special property 


$23 = 831 = S12 = 9, 
so that the level surfaces all cut one another at right angles. In such a case 


the three systems of surfaces are said to be mutually orthogonal. 
Differentiating the equation ry°r2 — 0 with respect to u3, we obtain 


ry3't2 + ry the3 = 0. 


From this and two other such equations (derived by cyclic permutation of 
123) we deduce 


ry'lo3 = Fe't3s1 = rstti2 = 0. 


Since rg is normal to the surface u3 — c of the third system, this surface 
satisfies not only g12 = 0 but also, since rg-ry2 = 0, 


by = nny. = 0, 


as in 19.59. Hence the parametric curves uw} = a and u? = 5 on this sur- 
face are lines of curvature. Since a similar result holds for a surface of either 
of the other systems, we have now proved 


DUPIN’S THEOREM. Jn three mutually orthogonal systems of surfaces, the 
lines of curvature on any surface in one of the systems are its intersections with 
the surfaces of the other two systems. 


Moreover, any surface may be exhibited as a member of one of three mu- 
tually orthogonal systems. This can actually be done in many ways. One 
way is to use a system of parallel surfaces, defined as the loci of points at con- 
stant distances along the normals of the given surface. Using the lines of 
curvature as parametric curves, we may express the position vector of a typi- 
cal parallel surface in the form 


For4un. 


We see from 19.57 that the directions of the parametric curves on the new 
surface are given by 


Fi=(r4+ wn), =r; + W8n; 
=r —-— Wkan = UI — kau)n, 
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that is, they are parallel to those on the original surface. Therefore, both 
surfaces have the same normal: fi = n. Since 


812 = Pitre = 0 
and 
byg = —Wye Fe = — nyt hy = Ka, rh = 0, 


the parametric curves on the new surface are again lines of curvature 
[Weatherburn 2, p. 159]. Allowing wu? to take various values, we obtain a 
whole system of parallel surfaces. The other two orthogonal systems are 
traced out by the normals at points that run along the lines of curvature [La 
Vallée Poussin 2,p. 447]. 


In §6.7 and §7.7 we analysed the general circle-preserving and sphere- 
preserving transformations. In §9.7 we indicated how the theory of func- 
tions of a complex variable could be employed to prove that the circle-pre- 
serving transformations are the only angle-preserving transformations of the 
whole inversive plane into itself. It is very remarkable that the analogous 
theorem in three (or more) dimensions is elementary! We merely have to 
observe that, if a transformation of space preserves the angles at which sur- 
faces cut, it transforms mutually orthogonal systems of surfaces into.mutually 
orthogonal systems of surfaces. Hence, if it transforms a surface o into an- 
other surface o’, it transforms the lines of curvature on o into the lines of 
curvature ono’. Since a sphere (including a plane as a special case) is charac- 
terized by the property that a/l directions on it are principal directions, we 
can immediately deduce 


LIOUVILLE’S THEOREM. Every angle-preserving transformation is a sphere- 
preserving transformation. 


Taking this along with 7.71, we see that every angle-preserving transfor- 
mation is either a similarity or the product of an inversion and an isometry 
[Forder 3, pp. 137-138]. 

EXERCISES 


1. When the surfaces uv} = a and uv? = b are traced out by normals along lines of 
curvature, while the surfaces u3 = c are “parallel,” the fundamental magnitudes for 
ui = aare naturally denoted by goe, 993, 233, bee, be3, b33. Dupin’s theorem shows that 
823 = bo3 = 0. Compute b33, and deduce that for this surface K = 0. 

2. The central quadrics 


x2 y2 2 
yee ie es Ga a agsene 


(which are ellipsoids when A < C, hyperboloids of one sheet when C <A < B, hyper- 
boloids of two sheets when B < A < A) are said to be confocal. Ata point (x, y, z) 
on such a quadric, the direction of the normal is 


x y Zz 
A-—A’ B~A’ C~A]- 


LIOUVILLE'S THEOREM 363 


When x, y, z are given, 18.59 is a cubic equation for A, whose roots u}, u?, u3 (num- 
bered in increasing order) satisfy 


wc CCW BCBCA., 


Deduce that any point for which xyz ¥ 0 lies on three quadrics of the system (one of 
each kind), cutting one another orthogonally [La Vallée Poussin 2, p. 448]. 

3. Where are the lines of curvature on an ellipsoid? [Hilbert and Cohn-Vossen 
1, p. 189.] 


19.8 DUPIN’S INDICATRIX 


Although the indicatrix was not invented by Dupin, he made more effec- 
tive use than had his predecessors of this suggestive conic in which a 
plane parallel to, and “infinitesimally near to,” the tangent plane at 
any point of a surface intersects the surface. 


E. T. Bell [2,p. 331] 


When a surface is given in Monge’s formz = F(x, y), it is often conven- 
ient to use the coordinates x, y themselves as parameters, so that z is a 
function of x and y with derivatives 


a = 22, 2 = 2, G5 eee a 
ox ay ax? 
Differentiating r = (x, y, Z), we obtain 
ry = (1,0, 21), ro = (0,1, 22), wy = (0, 0, 24), 
whence 
811 = 14217, B12 = 2122, B22 = 1 + 207, 
= 211 G20 — B12? = 1 + 242 4+ 222, 
Vgn=r X re = (—%1, —22, 1), 
V8 by = V Baty = 25. 


If the coordinate axes are so chosen that the point under consideration 
is the origin and the normal there is the z-axis, we have 


n = (0, 0, 1), 
whence 23 =22=0, gi: =1, B12 =0, go =l, gal, dy = 2y. 
Since z is a function of x and y, we can expand it in a Maclaurin series 
z= 2(0,0) + 24% 4+ Zoey + 4 (211 x? + 2212 x + Zoey?) +} (Z111 x9 +...) 
= $(dy1 x? + 2by2 xy + beg y?) + (terms of higher degree in x and y). 
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The “terms of higher degree” become important if by; = by2g = beg = 0,in 
which case the origin is a parabolic umbilic. In all other cases the section 
by a plane z = e, parallel to the tangent plane z = 0 at a small distance |e], 
resembles the conic 

by, Xx? + 2by2 xy + bog y? = 2e, 


which is similar to Dupin’s indicatrix 
19.81 by, x? + 2bio xy + bog y? = “1. 


We shall find that this conic (or pair of conics) indicates, in a remarkably 
simple manner, the normal curvature in every direction. We first observe 
that this part of the surface is synclastic, anticlastic, or parabolic, accord- 
ing as 

b>0, 6<0, or 5b=0, 


that is, according as the indicatrix is an ellipse, two conjugate hyperbolas, 
or two parallel lines. (In the synclastic case the ambiguous sign in the equa- 
tion 19.81 must agree with the sign of 51; otherwise the plane z = e would 
fail to meet the surface. In the anticlastic case we need both signs: one for 
each of the two conjugate hyperbolas.) 

In the plane z = 0, which is the tangent plane at the origin, the vector 
(cos 9, sin @, 0), making an angle @ with the x-axis (1, 0, 0), may be identi- 
fied with the tangent 

= 
PT=t=2ain= (a1, a2, 0) 


of Figure 19.2a. Thus the contravariant components of t are a! = cos 6, 
a? = sin 6, and, by 19.35, the normal curvature in this direction is 


kK = 22 ba‘ ai 
19.82 = by, cos? 8 4+ 2by2 cos @ sin @ + bog sin? @. 
Expressing the indicatrix 19.81 in polar coordinates, we obtain 
by1 r2 cos? 6 + 2byo r?2 cos @sin 8 + boo r? sin? @ = 1, 


that is, «r2 = +1, or 
r= |e|-. 


In other words [La Vallée Poussin 2, p. 427], 


The radius of the indicatrix in any direction is equal to the square root of 
the radius of normal curvature in this direction. 


Another way of expressing the same idea is to rema*k that the surface is 
approximated by the paraboloid or parabolic cylinder 


22 = by, x? + 2byoxy + bog y?. 


In any direction (at the origin) the given surface and the quadric have the 
same normal curvature. 
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= 


Figure 19.80 


If we choose the x- and y-axes along the principal directions at the origin, 
as in Figure 19.8a,sothat byg=0 and, by 19.58, 4, = by, the indica- 
trix 19.81 is simply 

Ka XxX? + 2,2? = +1, 
and 19.82 yields Euler’s formula 
K = K4) Cos? @ 4 Ko, sin? 8 
for the normal curvature in a direction making an angle @ with the first prin- 
cipal direction. 
EXERCISES 


1. For which directions on a surface is the normal curvature equal to the arith- 
metic mean of the two principal curvatures? 


2. For the surface z = F(x, y), 


2H = 82 bys — 2812 bi2 + Bir ben (IL + 22")Zay — 221 22 212 + (1 + 217)Z22 


g (1 + 242 + 292)3/2 


3. The surface xyz = | has umbilics at the four vertices of a regular tetrahedron 
[Salmon 2, p. 300]. 


4. The surface z = x(x? — 3y?) has a parabolic umbilic at the origin. Sketch the 
section by a plane z = e, where e is a small number, positive or negative. 

This surface is called the monkey saddle because it would be the right kind of saddle 
for a monkey riding a bicycle: one way down for each hind leg and a third for the tail. 
Hilbert and Cohn-Vossen [1, p. 202, Fig. 213] made a nice drawing of the surface but 
a very misleading one of its generalized indicatrix [ibid., p. 192, Fig. 200]. For the 
true shape, see the second edition of Struik [i, p. 85]. 
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Geodesics 


Imagine a two-dimensional creature, sufficiently intelligent to make pre- 
cise measurements on the surface he inhabited, but unable to conceive of a 
third dimension. His world might be a plane or a parabolic cylinder; he 
could not tell the difference. Ifit were a circular cylinder, local measure- 
ments would still give the same results, though an expedition all the way 
round would reveal a topological peculiarity. In all these cases his conclu- 
sion would be that his surface had zero curvature: K = 0. If, on the other 
hand, the surface were a sphere, he could detect its positive curvature by 
measurements within easy reach of his home, even if the radius of the sphere 
were relatively large. This information is a consequence of Gauss’s formula 
20.16, which expresses K in terms of the fundamental magnitudes of the 
first order. In §20.3 we shall see how Gauss’s complicated expression can 
be replaced by his simple one involving the three angles of a triangle 
(like our formula 6.92 for the area of a spherical triangle). In § 20.4 we shall 
extend these local measurements to global measurements, which would en- 
able our intelligent ant to determine the topological nature of his world: an 
idea which we shall investigate more systematically in Chapter 21. 

The remaining sections of the present chapter deal with the differential- 
geometric approach to the non-Euclidean planes, which may be regarded 
as surfaces of constant curvature. 


20.1 THEOREMA EGREGIUM 


The Christoffel symbols are called after Erwin Bruno Christoffel (1829 - 
1901), who introduced [them] in 1869, .. . denoting our Pi,by (7k}, 


The change to our present notation has been made under the influence 
of tensor theory. 


D. J. Struik (1894 - ) 
[Struik 1, p. 108] 


For an adequate discussion of “geodesics,” which are the most important 
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curves on a surface, we need one more notational device: the Christoffel 
symbols 


20.11 Tit = ry" lk, Ik = rycrk, 

In virtue of 19.15, these symbols are related as follows: 
20.12 Tiga = 28x ee by, = 2g Ty.) 
Clearly, Poin = Tize and Ts, = Vk,. 


Since the derivatives of the fundamental magnitudes g;; are 
(Sisk = a (ricty) = rigs + rat lx 


20.13 = Tay + Ty, 


we can compute the Christoffel symbols of the first kind by means of the 
formula 


20.14 Tipe = 4 {(8im)i + (Bix)s — (Bi)u} 


and then deduce the Christoffel symbols of the second kind by means of the 
latter half of 20.12. 
Applying 18.21 with u = r;; andrs = r? = n, we obtain 


ry = ryt + orytrtre + ryt rg 


= Thin + [2r4+ rena 


20.15 = 2Te tot byn. 


These expressions for the second derivatives of r are known as “the equa- 
tions of Gauss.” 


Another of Gauss’s discoveries, which pleased him so much that he named 
it Theorema egregium, is an expression for K in terms of the g’s and their 
derivatives. This means that the Gaussian curvature can be computed by 
measurements made on the surface itself, without reference to the three- 
dimensional space in which it may lie. In other words, K is a “bending in- 
variant,” unchanged by the kind of distortion that takes place when a flat 
sheet of paper is rolled up to make a cylinder or a cone. The expression 
appears in the literature in various forms, not obviously identical. One of 
the neatest, discovered by Liouville,* is 


0 {v8 0 {v8 
20.16 Ka=—/|¥Yep2 |); Yee). 
ve due i r] ce 12 
This can be derived by applying 19.49 to the unit tangent vector t = ry/g1. 
Since [n ry ry] = 0 and (by 19.33) n x ry = V/gr?, we have 


* J. Liouville, Sur la théorie générale des surfaces, Journal de Mathématiques 16 (1851), pp. 
130-132. 
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My Pai Le 
tt] = | ed: 1] = : 
[n ] n oe (n X ry) ei 


vere = VE pe. 


§11 S11 
Hence Vz K= Bei VE r% 
Su i 
= (V8r _ [V8 pe 
oe i), 811 14),. 


Theorema egregium takes a more symmetrical form when the parametric 
curves are orthogonal, so that 


G2 = 0, gi = I7gy = 14g))". 
v8 T2, = 8182 (S22)1 _ 82 2go(82)1 _ (82)1 


811 811 2822 81 2g 21 
and 
V8 po — £182 [gin — _ 82 28i(8i)2 _ _ (8i)2- 
gi gi 2822 81 282? 82 
In fact, 
appease r2, vg a 
eee (xv hg (vs U1}, 
20.17 = ((32) “ (2a 
§1 Jt 82 }/2 


[Weatherburn 2, p. 98; Struik 1, p. 113]. 
EXERCISES 
1. Obtain a variant of 20.16 by taking t = re/ge. 
2. For the case when gi2 = 0, express all the Christoffel symbols in terms of gii, 
Zoo, and their derivatives. 
3. Compute the Christoffel symbols for polar coordinates in the plane. 
4. For a surface in Monge’s form z = F(x, y), 


ee = Zi; Z,/(1 + 212 + 22”). 
5. ZTi, = (log Vg); 
6. *Compute K for a surface on which 
(u2)2 + a —~wiy2 
‘a {(u))2 + (u2)2 + a2}2” gu = {QPP + (U2)? + a2}? : 
(ul)? + a? 
{(ul)2 + (u2)2 + a2}? 


811 


§22 = 


*E, Beltrami, Annali di Matematica (1) 7 (1866), pp. 197-198. 


THEOREMA EGREGIUM 369 


20.2 THE DIFFERENTIAL EQUATIONS FOR GEODESICS 


Every sufficiently small portion of a geodesic is the shortest path on 
the surface connecting the end-points of the portion... . All the in- 
trinsic properties of a surface (such as its Gaussian curvature) can be 
determined by drawing geodesics and measuring their arc lengths... . 
We can obtain an approximation to the geodesics by moving a very 
small buggy along the surface on two wheels, the wheels being rigidly 
fastened to their common axis so that their speeds of rotation are 
equal. ... The entire course of a geodesic is determined if one of its 
points and its direction at this point are given.... The straightest lines 
may also be characterized by the geometric requirement that the os- 
culating plane of the curve is to contain the normal to the surface at 
every point of the curve. 


Hilbert and Cohn-Vossen 
[1, pp. 220 -221] 


Consider the possibility of a curve on a surface having all its principal 
normals normal to the surface. As we saw in § 19.3, any curve on a sur- 
face satisfies 


kp =t = £ Site; = Diir, + Siu w ry. 
In the present case, since p = n is perpendicular to r*, we have 
(2 tir, + SD ww ri;)er* = 0. 
Since rer® = S* and ryerk = re these equations reduce to 
20.21 we 4 2D Mm wu = 0 (k = 1, 2), 
meaning 


d2uk dut dui 
SD Ph G4 a = 
ds? - Y ds ds 


([Struik 1, p. 132]. Theoretically, we could eliminate s from these two equa- 
tions so as to obtain a single differential equation; but it is usually more 
convenient either to use both equations or to use one of them along with 
19.16, ; 

The curves so determined are called geodesics [Weatherburn 2, p. 100]. 
Since the equations express the second derivatives of u* in terms of the first 
derivatives, there is a geodesic through any given point A (on the surface) 
in any given direction. There is also, in general, a unique geodesic joining 
two given points A and B. In these respects the geodesics on a surface re- 
semble the straight lines in a plane; in fact, as we shall see, they are straight 
when the surface is a plane. 

When gi. = 0, so that the parametric curves are orthogonal, the differ- 
ential equations take the form 
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20.22 211 i + 4(811)1 (4)? + (11) WU? — 4$(800)1 (U?)? = 0, 
20.23 = ge0 i? — 4(211)2 (U1)? + (B22): Ut a? + 4(g20)0 (U2)? = 0. 


The latter shows that the parametric curves a? = 0 occur among the geo- 
desics if and only if g,1 is a function of vu? alone (not involving u?), so that 
(g11)2 = 0. In this case the curves 4? = 0 are a one-parameter system of 
geodesics and the curves #1 = 0 are their orthogonal trajectories. Since 
811 is a function of wu} alone, the differential form 


ds? = gy; (du)? + o0 (du?)? 


can be simplified by changing the notation so that i g1 duiis called v1. Then 
8i=1, 8 =822, and 


20.24 ds? = (dul)? + g(du2)?. 


The effect of this change of notation is to make ut measure the arc of each 
geodesic u? = constant. The differential equations are now 


20.25 #1 — 4(g), (42)? = 0, 
20.26 & (gui?) — 4g): (u2)? = 0. 


In particular, we obtain geodesic polar coordinates (analogous to ordinary 
polar coordinates in the plane) by measuring uw! from A along all the geo- 
desics through A, and defining u? to be the angle that such a geodesic u2 = 
constant makes with an “initial” geodesic u2 = 0. 

The length of any curve from A to a point B (of general position) is ob- 
tained by integrating ds along the curve. The equation 20.24 shows that 
fds > f du1, with equality only when du? = 0. Hence the geodesic A Bis 
the shortest path from A to B. In fact, it is the curve along which a tightly 
stretched string would lie on the smooth convex side of a material surface. 
Since the only forces acting on an “element” of the string are the tensions 
at its two ends and the reaction of the surface along the normal n, these 
three forces must be in equilibrium. Hence n must lie in the plane of the 
two tensions, which is the osculating plane of the curve. These considera- 
tions provide a statical explanation for the equation p = n which started 
this investigation. 

The curves u4 = constant (which, of course, are not geodesics) are called 
“geodesic circles.” The circumference of such a “circle” is obtained by in- 
tegrating ds (given by 20.24 with dut = 0); thus it is 


20 


0 V8 du?. 


When the radius wu is small, this circumference is approximated by both 
2a\/g and 27u1. Hence the first term in the Maclaurin series for \/g is 
simply u1, that is, 
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20.27 If wt=0, then (Vg) = 1. 


Since geodesics are curves of shortest length, the geodesics on a sphere 
are the great circles, and the geodesics in a plane are the straight lines. We 
can adapt Figure 8.55 to geodesic polar coordinates by writing wu} and u2 
for r and 6, so that 20.24 expresses Pythagoras’s theorem for the infinitesi- 
mal triangle PP’N, and the angle ¢ between the geodesics OP’ and PP’ is 
given by 


se =u} or sing = lim ais = gue. 


cos @ = lim 
Differentiating cos ¢ and using 20.25, we obtain 


—sing d= i = He): (08)? = 38% (yg ait 


= (/g)1(sin — 
Thus 
20.28 db = —(V/8)1 du?. 
EXERCISES 


1. In the case of the general surface of revolution 
r= (uicosu?, uisinu?, 2), 


where z is a function of u1 alone, the differential equation 20.23 for geodesics reduces to 


] [wo a | =0. 


One solution is du? = 0, showing that the meridians are geodesics. In all other cases 
the constant value of (#3)? 42 may be denoted by 1/h, so that 


ds = h (u)? du?. 
Comparing this with 19.16, obtain the complete integral 


page 1422 fae 
hu)? — wi 
[Weatherburn 2, p. 102]. 


2. The geodesics on a cylinder are helices. 


3. The geodesics on the cone 
r= (uicosu?, wisinu®, ulcos a) 
are given by 
au\ = sec(B + u? sin a), 


where a and 8 are constants. Are these curves concho-spirals? 
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20.3. THE INTEGRAL CURVATURE OF A GEODESIC TRIANGLE 


The integral curvature of a region on the surface is equal to the area 
of its spherical image,... This property of the integral curvature 
was already known to the French school of Monge before Gauss 
pointed out its significance for the intrinsic geometry of a surface. 


D. J. Struik [1, p. 156] 


Formulas 6.92 and 16.53, for the areas of spherical and hyperbolic tri- 
angles, are special cases of a beautiful formula which Gauss discovered for 
the integral curvature of a triangle formed by arcs of three geodesics on any 
smooth surface. We proceed to establish this formula 


20.31 If Ri Sd Bt Coe 
ABC 


Setting g1 = | and go = 1/g in 20.17, we see that, when geodesic polar 
coordinates are used, the formula for K is simply 


20.32 VgK = —(Vahu- 


Figure 20.3a 


Consider a geodesic triangle ABC with its side AB along the initial geo- 
desic u? = 0, as in Figure 20.3a. Integrating K over the area of this tri- 
angle with the help of 19.24, we obtain 


[[Kas = [[Kvgaedae = —ffive) da ae 
—f (van |e ae. 


By 20.27, (./8)1 = | when wu! = 0; and by 20.28, —(,/g), du? = do for any 
point on the geodesic BC. Hence 


[LHe =['0~ overran = "ae +] a 
=A+C-—-(w-—B=A+B4HC-2 


[Weatherburn 2, p. 117]. 
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EXERCISES 
1. Obtain 20.32 directly from 19.49 with t = ri. 


2. On the unit sphere, colatitude and longitude serve as geodesic polar coordinates 
with g = sin? u1. What happens to 20,32 in this case? For convenience, write r, 6 
for wi, u2. The differential equation 20.26 (with (g)2 = 0) has, as a first integral, 


(sin? 9 = 1/h 
(an arbitrary constant). Combining this formula for 6 = d0/ds with 
ds2 = dr? + sin? r d@?, 
deduce dr = sin r \/h? sin? r — | a8, 
px csc? r dr 
Vi eer 
where k = l/r? — 1, and 


= 9 + arccos (k cot r) 


k cotr = cos(@ — 4). 
Expressing this solution in terms of the Cartesian coordinates 
x = Sinrcos@, y = sinrsin@, z= cosr, 


verify that the geodesics on a sphere are the great circles (lying in planes through the 
origin). 


20.4 THE EULER-POINCARE CHARACTERISTIC 


As we saw on page 281, any polyhedron inscribed in a sphere can be pro- 
jected from the center onto the surface of the sphere so as to form a map. 
In fact, the V vertices are joined in pairs by E geodesic arcs (which we still 
call edges), decomposing the spherical surface into F polygonal regions 
(which we still call faces). More generally, a map may be obtained by draw- 
ing a sufficient number of geodesic arcs on any closed surface. We can 
insist that the points (“vertices”) be so placed and so joined that every face 
is simply connected, that is, that the boundary of the face can be continu- 
ously shrunk to a single point without leaving the surface. 

In § 10.3 we used a Schlegel diagram to prove Euler’s formula. We could 
just as well have used the corresponding map on a sphere. The same argu- 
ment, applied to a map on the general surface, shows that the Euler-Poin- 
caré characteristic 


x= V-ES+F 


is essentially a property of the surface, that is, that it has the same value 
for all maps drawn on the given surface. It is a remarkable fact that this 
property of the surface can be expressed very simply in terms of the integral 
curvature (which is obtained by integrating K over the whole surface). 
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Consider first a sphere, on which a map (with V = E = 3 and F = 2) is 
obtained by taking, as vertices, three points on a great circle. Each hemi- 
sphere is bounded by three arcs of the great circle, forming a spherical “tri- 
angle” whose three angles are 7, 7, 7. By 20.31, each hemisphere has in- 
tegral curvature 


T+ata— 7 = 27. 


Hence the integral curvature of the whole sphere is 47 (as it obviously must 
be, since the “spherical image’”’ of any sphere is a unit sphere, whose sur- 
face area is 47). The general formula, of which this is a very special case, is 


20.41 [[ Kas = 2x, 


where the integration is taken over the whole of any given surface of charac- 
teristic x. 


To establish 20.41, we consider a map formed by F geodesic arcs on the 
given surface, choosing the V vertices in such positions that no face has a 
re-entrant angle (i.e., an angle greater than 7). The map can then be “tri- 
angulated” by selecting a new vertex inside each face and joining it by new 
geodesic arcs to all the vertices of that face. This procedure yields a new 
map having V + F vertices and 2£ triangular faces. Since the sum of all 
the angles of all these 2F triangles amounts to 27 for each of the V + F 
vertices, the integral curvature of the whole surface is 


[[kas =3(44B4C-7) 
= nV + F) — 2En = In(V + F — E) 
= 2nx. 


It follows that the integral curvature of a closed surface is not altered by 
topological transformation. For instance, the value 47 is maintained when 
a sphere is deformed into an ellipsoid or any other oval surface. The de- 
formation may even be continued so as to bring in anticlastic regions. 


EXERCISES 


1. The torus 8.88 (where a > 5) is constructed by revolving a circle of radius b about 
a line (in its plane) distant a from the center. On this surface we can draw two circles, 
of radii b and a + b, which are geodesics having just one common point. These form 
amap in which V = F = 1, E = 2. Hence the integral curvature is zero. (The posi- 
tive integral curvature of the “outer” synclastic part of the torus is exactly balanced 
by the negative integral curvature of the “inner” anticlastic part.) 

2. Describe two further geodesics on the torus so that the four geodesics make a 
map in which V = F = 4, E = 8. 
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20.5 SURFACES OF CONSTANT CURVATURE 


When Gauss was nineteen his mother asked his mathematical friend 
Wolfgang Bolyai whether Gauss would ever amount to anything. 
When Bolyai exclaimed, “The greatest mathematician in Europe!” she 
burst into tears. 


E. T. Bell [1, p. 252] 


When we study a surface by means of the fundamental magnitudes of 
the first order and the consequent Christoffel symbols, we are treating it 
“intrinsically,” exploring it like the hypothetical two-dimensional creature 
who could not imagine any direction outside the surface itself. Such a crea- 
ture could measure distances by means of the formula 19.16, distinguish 
geodesics as the shortest paths from place to place, and measure the Gaus- 
sian curvature K at any point. 


One of the most elegant approaches to non-Euclidean geometry is to re- 
gard the elliptic or hyperbolic plane as a nondevelopable surface which is 
homogeneous (all positions alike) and isotropic (all directions alike). Since 
the surface is homogeneous, its Gaussian curvature is constant. By using 
a suitable unit of distance, we may take the constant value of K to be | or 
—1 according as it is positive or negative. We shall find it convenient to 
use geodesic polar coordinates. Since the surface is homogeneous and iso- 
tropic, the expression 20.24 will be the same wherever we place the pole 
u1 = 0 and the initial geodesic u? = 0, and g will be a function of wu} alone, 
independent of u?._ The “straight lines” of the geometry are the geodesics 
on the surface, and it is not necessary to regard the surface as being embedded 
in a 3-space. 

Setting K = +1 in 20.32, we obtain the differential equation 


(V8) = +V8 
which yields 


Vg =Asinul + Bcosut or A sinh u1 + B cosh uw, 


At the pole, ds (= du) is independent of u?; therefore g = 0 when u1 = 0; 
that is, B = 0. Also, by 20.27, A = 1. Hence 


Vg=sinul or sinhu! 
and 
ds? = (dul)? + sin? ut (du2)2 or (du)? 4+ sinh? uw} (du?)?. 
EXERCISE 


Compute the circumference of the geodesic circle u1 = r (i) in the elliptic plane, 
(ii) in the hyperbolic plane. 
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20.6 THE ANGLE OF PARALLELISM 
In the elliptic case, we can identify u and vu? with colatitude and longi- 


tude on the unit sphere (as in Ex. 2 at the end of § 20.3). Accordingly, we 
now restrict consideration to the hyperbolic case, in which 


g = sinh? uw}, 
For convenience, let us write r, @ for u4, u2, so that* 
ds? = dr? +4 sinh? r dé@?. 


To determine the straight lines of the hyperbolic plane, we use the differ- 
ential equation 20.26 with g — sinh?r, namely, 


a (sinh? r 6) = 0. 
For a first integral we obtain sinh? r6 = A-1 (a constant), whence 


dr? 4 sinh? r d@2 = ds? = (h sinh? r d6)?, 
sothat dr? = sinh? r (h? sinh? r — 1)d62 and 


_ dr = csch? r dr 
sinh r \/h? sinh? r — | Wh? — csch? r 
dcothr 
—— | sara TTT = 9% + arccos (k coth r), 
where k = 1/.\/h? + 1. Hence, finally, the lines are given by 
k cothr = cos(@ — 4). 


When A tends to infinity, so that k tends to zero, we have the radial lines, for 
which @ is constant. The line through (a, 0) perpendicular to the initial line 
@ = 0 (being a geodesic which is unchanged when @ is replaced by — 6), is 


tanh a cothr = cos @. 


We can use these results to find relations between the sides and angles of 
a right-angled triangle ABC with its right angle at C and its side BC along 
the initial line, as in Figure 20.6a. Since the equations for the lines BC, 
AB, and CA are 


6=0, @6=8, and tanha = tanhrcos@, 
the lengths of the sides BC = a and AB = care related by the equation 
tanh a = tanhc cos B 


[Carslaw 1, p. 109; Coxeter 3, pp. 238, 282]. (Another formula of the same 
kind can be obtained by changing a and B into 6 and A.) 


* E, Beltrami, Giornale di Matematiche, 6 (1868), p. 298 (12). 
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TI(a) 
a 


Figure 20.6a 


The angle of parallelism II(a) is the value of B that makes c infinite; that is, 


II(a) = B, 
where cos B = tanha, sin B = secha, 
csc B = cosha, cot B = sinha, 
cscB —cotB = cosha — sinha, 
tan4B = e-4, 


We have thus established Lobachevsky’s formula 
II(@) = 2 arctan e-¢ 


[Coxeter 3, p. 208]. This is a precise expression for the function that we 
studied tentatively in § 16.3. 


EXERCISE 


Compute II(a) for a few suitably chosen values of a, and sketch the curve y = I](x). 
Where does II(u) occur in Figure 17.45? 


20.7 THE PSEUDOSPHERE 


Having obtained the hyperbolic plane as a surface of constant negative 
curvature, it is natural for us to ask whether such a surface can be embedded 
in Euclidean space. In other words, can the hyperbolic plane, or a finite 
part of it, be represented isometrically by a surface in ordinary space, in 
some such manner as the elliptic plane is represented (twice over) by a 
sphere? The answer is No and Yes: there is no such representation of the 
whole hyperbolic plane* but there are certain surfaces that will serve for a 
portion of finite area [Klein 4, p. 286]. The simplest instance, which Liou- 
ville named the pseudosphere, is one half of the “tractroid” formed by re- 
volving the tractrix 17.51 about its asymptote. 

Writing z for x, x for y, and setting a = 1, we obtain the tractrix 


x = sechul, z= ui} — tanh ut 


*G, Litkemeyer, Ueber den analytischen Charakter der Integrale von partiellen Differentialglei- 
chungen (Géttingen, 1902), 
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in the plane y = 0. Revolution about the z-axis yields the tractroid 
x = sech u1 cos u?, y = sech u! sin v2, z= ui — tanh ul}, 


which has a cuspidal edge where u} = 0. The pseudosphere is the horn- 
shaped surface given by u1 > 0. Differentiating the position vector r = 
(x, y, Z), we obtain 


811 = tanh? wu}, 812 = 0, Boo = sech? ui, 
Setting g1 = tanh v1, go = sech u1 in 20.17, we deduce 
—tanh wu! sech w1K = (—sech u1); = sech wv! tanh uv}, 
whence K=-1. 


Since the pseudosphere has the same Gaussian curvature as the hyperbolic 
plane, the geodesics on the former represent lines in the latter isometri- 
cally. Among these geodesics are the meridians (for which uv? is constant), 
representing a pencil of parallels. Orthogonal to them, we find the circles 
for which u1 = constant, representing arcs of concentric horocycles (§ 16.6). 
The longest of these horocyclic arcs is of length 27, since it is represented 
by the circle 


xX = cos u?, y = sin u2 


in the plane uw! = 0. Thus the whole pseudosphere represents the horocyclic 
sector bounded by this arc of length 27 and the diameters at its two ends. 
The horocyclic sector is wrapped round the pseudosphere so that the two 
diameters are brought together to form a single meridian. 

This limitation to a horocyclic sector renders the pseudosphere utterly 
useless as a means for drawing significant hyperbolic figures. Every geo- 
desic that is not merely a meridian winds itself round the “horn” as it pro- 
ceeds in one direction, whereas in the opposite direction it is abruptly cut 
off by the cuspidal edge. Thus we cannot even draw such a simple arrange- 
ment of lines as Figure 16.3a! These remarks are needed to counteract the 
widespread but mistaken idea that hyperbolic geometry can be identified 
with the intrinsic geometry of the pseudosphere. 


EXERCISE 


Use 20.23 to obtain an equation for the geodesics on the pseudosphere. 


21 


Topology of surfaces 


In Chapter 4 we considered various tessellations of the Euclidean plane 
(including, in § 4.6, regular tessellations). These may be regarded as infinite 
“maps.” In § 15.7 we considered the analogous tessellations of a sphere, 
which are finite maps. In § 10.3 we proved Euler’s formula 


V—E+F=2, 


which connects the numbers of vertices, edges (or arcs), and faces (or regions) 
of any map drawn on a sphere. In §20.4 we extended this to 


V=EPFax<2 


for a map on any closed surface, the Euler-Poincaré characteristic x being 
the same for all maps on the given surface. In § 6.9 we identified antipodal 
points of a sphere so as to obtain the real projective plane; for a centrally 
symmetrical map on the sphere, this identification naturally halves V, E, 
and F, thus reducing x from 2 to |. In Figure 10.5a we considered recip- 
rocal polyhedra which, when regarded as spherical tessellations, are a spe- 
cial case of dual maps. In § 10.1 we defined the Schlafli symbol {p, q}, 
which is appropriate for a map of p-gons, q at each vertex; and in 10.31 we 
obtained the equations gV = 2E = pF. In § 15.4 we discussed groups of 
permutations of the faces of a map. In § 15.3 we found that the theory of 
translations and glide reflections belongs to absolute geometry; that is, that 
it belongs not only to Euclidean geometry but also to hyperbolic geometry. 

The present chapter applies all these ideas to a discussion of the topolog- 
ical properties of surfaces, including the conjecture of P. J. Heawood that, 
for the coloring of any map on a surface of characteristic x, 


[3 + 4/49 — 24x] 


colors suffice. What makes this conjecture remarkable is that, although in 
1890 he established its truth for every x < 2, it still remains an open ques- 
tion for maps on the ordinary sphere or plane. Another conjecture is that 
Heawood’s formula is “best possible” in the sense that, for each x, a map 
requiring the full number of colors can be drawn. 
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21.1  ORIENTABLE SURFACES 


The group of transformations of greatest importance in present-day 
mathematics, namely, fhe group of topological transformations, is far 
wider than the projective group. Here we are dealing with a topo- 
logical space: that is, with a set of elements, call them points, for which 
the concept of a neighborhood is defined... . Any transformation 
preserving neighborhoods is called topological. 

Topology may be visualized as rubber-sheet geometry, since a fopo- 
logical transformation permits any amount of stretching or compress- 
ing (without tearing). 


S. H. Gould [1, p. 304] 


In Chapter 5 we mentioned Klein’s famous classification of geometries 
according to the groups of transformations under which their theorems re- 
main true. In this sense, projective geometry is characterized by the group 
of collineations and correlations, and hyperbolic geometry by the subgroup 
of collineations leaving invariant a conic (the locus of points at infinity). 
Topology, sometimes described as “the most general of all geometries,” is 
characterized by the group of continuous transformations. For instance, 
since a polyhedron can be continuously transformed into the correspond- 
ing spherical tessellation, topology does not recognize any distinction be- 
tween the polyhedron and the tessellation. Again, in § 20.4 we defined the 
characteristic 


21.11 x=V—-E4F 


in terms of a map formed by geodesic arcs on the given surface, but the 
value of x will not change if we replace the geodesic arcs by any continu- 
ous arcs which join the same pairs of points without crossing one another. 
In other words, the edges of the map are not necessarily geodesics like the 
boundary between Colorado and Utah; they can just as well be “wild” like 
the boundary between Indiana and Kentucky. 

In the same spirit, the torus 8.88 is topologically equivalent to a sphere 
with a handle (like the handle of a teacup), and we can derive more compli- 


Figure 21.14 
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cated surfaces by adding any number of further handles. The operation of 
adding a handle to a given surface reduces the value of x by 2. For, since 
the handle may be chosen in the form of a bent triangular prism joining 
two triangular faces of a suitable map, as in Figure 21.1a, its insertion leaves 
V unchanged, increases F by 3, and increases F by 3 — 2. Knowing that a 
a sphere has x = 2, we deduce that a sphere with p handles has 


21.12 x= 2-—2p. 


This is called a surface of genus p. In particular, a sphere is a surface of 
genus 0 and a torus is a surface of genus I. 

From a rectangular rubber sheet, we can make a model of a torus by iden- 
tifying, or bringing together, each pair of opposite sides. The first identifi- 
cation produces a tube, and the second an “inner tube.” Conversely, by 
cutting a torus along two circles that have only one common point, we can 
unfold it (after some distortion) to make a rectangle whose pairs of oppo- 
site sides arise from the two cuts. More generally, given a surface of genus 
p, a suitable set of 2p cuts, all beginning and ending at a single point, en- 
ables us to unfold the surface into a 4p-gon whose pairs of opposite sides 
arise from the 2p cuts [Coxeter and Moser 1, p. 25], as in Figure 21.1b. 


eo (p=2) 


Figure 21.1b 


If we regard the 2p cuts on the surface as the 2p edges of a map, we find 
that this map has one face and one vertex, in agreement with the formula 


When p = I, the rectangle is conveniently taken to be a square, and this 
square may be regarded as one of the infinitely many faces of the regular tes- 
sellation {4, 4} (Figure 4.6a) or as the unit cell of a lattice generated by two 
translations in perpendicular directions. The identification of opposite sides 
may be achieved by identifying points in corresponding positions in all the 
squares, that is, by pretending that the translations have no effect. In tech- 
nical language, the Euclidean plane is the universal covering surface of the 
torus. Similarly, when p > 1, the 4p-gon is conveniently taken to be a regu- 
lar 4p-gon of angle 7/2p in the hyperbolic plane, and this {4p} may be re- 
garded as a face of a regular hyperbolic tessellation {4p, 4p}. Opposite 
sides of the {4p} are related by 2p translations, which generate a group hav- 
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ing the {4p} for a fundamental region. The infinite hyperbolic plane (which 
is the universal covering surface) is reduced to the given finite surface by 
identifying each point in one {4p} with the corresponding points in all the 
other {4p}’s. The group of translations is called the fundamental group of 
the surface [Coxeter and Moser 1, pp. 24-27, 58-60]. 


EXERCISE 


A graph is a set of points (called vertices), certain pairs of which are joined by arcs 
(called edges). In particular, the vertices and edges of a map forma graph. Conversely, 
any connected graph can be drawn on a surface so as to form a map covering the sur- 
face.* A graph is said to be planar if it can be drawn on a sphere without any edges 
crossing one another, in which case it can just as easily be drawn in the (inversive) 
plane, provided we allow one face of the consequent map to be infinite. A vertex is 
said to have valency (or “degree’’) q if it belongs to g edges. A graph is said to be ¢triva- 
lent if every vertex belongs to three edges. In this case 3V = 2E; therefore V is even. 
The Thomsen graph} has six vertices Pi,..., Pg and nine edges P;P;, where i + j is 
odd. This is the simpest nonplanar trivalent graph. Can it be drawn on a torus? 


21.2 NONORIENTABLE SURFACES 


A surface is non-orientable if and only if there exists on the surface 
some closed curve .. . such that a small oriented circle whose center 
traverses the curve continuously will arrive at its starting point with 
its orientation reversed. 


Hilbert and Cohn-Vossen 
[1, p. 306] 


Each of the surfaces discussed in § 21.1 is orientable, that is, a positive 
sense of rotation can be defined consistently everywhere. More precisely, 
the faces of any map on the surface can be regarded as directed polygons 
in such a way that the two directions thus assigned to each edge disagree, 
or cancel out. A surface is said to be nonorientable if it admits one map 
which cannot be oriented in this manner. The most famous instance is the 
Mobius strip, which can be illustrated by taking a strip of paper A BAB, sev- 
eral times longer than it is wide, and sticking the two ends together after 
twisting one of them by a half-turn. Its nonorientability can be checked 
by means of a map consisting of a single row of squares. It is one-sided in 
the sense that an ant could crawl along the whole length of the strip, with- 
out crossing the bounding edge, and find himself at the starting point on 
the ‘‘other side.” If two wheels in a machine are connected by a belt of 
such a shape (e.g., for the purpose of conveying hot or abrasive materials), 
the substance of the belt will wear out equally on both sides. A patent for 

* J, H. Lindsay, Jr., Elementary treatment of the imbedding of a graph in a surface, American 


Mathematical Monthly, 66 (1959), pp. 117-118. 
+ W. Blaschke and G. Bol, Geometrie der Gewebe (Berlin, 1938), p. 35. 
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this practical application of the MObius strip has been acquired by the Good- 
rich Company.* 


Unlike the closed surfaces considered in § 21.1, the Mobius strip is 
bounded. The boundary is a simple closed curve, but physically it cannot 
be shrunk away because, if we make it coincide with the circumference 
of a circle, the interior of the circle must intersect the surface of the strip. 
This practical difficulty arises because the model is embedded in Euclidean 
space. Theoretically, no such embedding is needed. When the boundary 
has been shrunk away, the resulting closed surface is topologically a real 
projective plane! In other words, the Mébius strip is the real projective plane 
with a hole cut out of it. For we may regard the projective plane (§ 6.9) as 
a sphere with antipodal points identified. When cutting out a circular hole 
round the north pole we must, of course, also cut out an equal hole round 
the south pole. What remains of the sphere is a zone bounded by two paral- 
lels of latitude such as the Tropics of Cancer and Capricorn. But the iden- 
tification of antipodes has the effect that only half the zone is needed, say 
the “‘visible” half (Figure 21.2a). This half-zone, with its ends AB identi- 
fied, is evidently a MObius strip. 


Figure 21.2a 


Instead of a whole sphere with every pair of antipodal points identified, 
we may regard the projective plane as a hemisphere (say the “southern” 
hemisphere) with identification of diametrically opposite points on the pe- 
ripheral equator. In the spirit of topology, the hemispherical surface can 
be stretched until it covers almost the whole sphere, and the periphery (with 
opposite points identified) is reduced to a very small circle round the north 
pole. In other words, the projective plane is topologically equivalent to a 
sphere with a cross-cap, which may be described as a small circular hole hav- 
ing the magic property that, as soon as the crawling ant reaches it, he finds 
himself leaving the same hole from its diametrically opposite point (inside. 
instead of outside, the sphere). 


* U.S, Patents 1,442,632 (1923), 2,479,929 (1949), 2,784,834 (1957). 
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Figure 21.2b 


Wecan derive more complicated surfaces by adding any number of cross- 
caps, each of which reduces the value of x by |. For, if a map on a given 
surface has a vertex A belonging to three faces, we can replace A by a cross- 
cap ABCABC as in Figure 21.26. Since this requires the formation of two 
new vertices B, C, and three new edges BC, CA, AB, the insertion of the 
cross-cap increases V by 2, E by 3, and leaves F unchanged. (The faces on 
the left and right now meet twice: along the original edge through A and 
again along the new edge BC.) Knowing that a sphere has x = 2, we de- 
duce that a sphere with q cross-caps has 


21.21 x=2-q. 


When q = | this is, as we have seen, the real projective plane. Wheng = 2 
it is the Klein bottle (or “nonorientable torus”) [Hilbert and Cohn-Vossen 
1, p. 308]. , 

A suitable set of q cuts, all beginning and ending at a single point and 
each passing through a different cross-cap, enables us to unfold the surface 
into a 2q-gon such that q pairs of adjacent sides arise from the q cuts [Coxe- 
ter and Moser 1, pp. 25-28, 56-58] as in Figure 21.2c. These cuts are the 
q edges of a map having one face and one vertex, in agreement with the 
formula 


x=V-E+F=1-—q+1=2-4q. 


Figure 21.2c 


When q = 1, the 2q-gon is a digon which may be regarded as one of the 
two faces of the spherical tessellation {2, 2} (see Ex. | at the end of § 15.7). 
In fact, the universal covering surface of the projective plane is the sphere, 
and its fundamental group is of order 2, generated by the central inversion. 

When q = 2. the 2q-gon may be regarded as a face of the Euclidean tes- 
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sellation {4, 4}.so that the universal covering surface is the Euclidean plane. 
Unlike the torus. whose fundamental group p1 is generated by two trans- 
lations, the Klein bottle has the fundamental group pg, generated by two 
glide reflections (see §4.3 and Plate T). Similarly, when q > 2, so that the 
universal covering surface is the hyperbolic plane, the 2q-gon is a face of 
the hyperbolic tessellation {2q, 2q}, and the fundamental group is gener- 
ated by q glide reflections [Coxeter and Moser 1, pp. 56-58]. 


EXERCISES 


1. The projective plane is topologically equivalent to a disk* with diametrically 
opposite points identified. 

2. How can the Thomsen graph (see the end of §21.1) be drawn on a sphere with 
a cross-cap (or a disk with opposite points identified)? 

3. What happens to the vertices and edges of a regular hexagonal prism when we 
project it centrally onto its circumsphere and then identify antipodes? 

4. Isa sphere with p handles and q cross-caps topologically equivalent to a sphere 
with 2p + q cross-caps? 


21.3 REGULAR MAPS 


We first give a method of reducing any two-dimensional manifold to 
one of the known polygonal normal forms. The method used is one 
by which a polygon on which the manifold is represented is subjected 
to a series of fransformations by cuiting if oporf in a simple manner 
and then joining it together again So os to obfain a new polygon rep- 
resenting the same manifold. 


H. R. Brahana (1B95 - ) 
(Annals of Mathematics, 23 (1921), p. 144) 


It can be proved} that every closed surface is topologically equivalent 
either to a sphere with p (> 0) handles (if the surface is orientable) or to a 
sphere with q(> 0) cross-caps. In virtue of 21.12 and 21.21, this means 
that, from the standpoint of topology, there is just one orientable closed 
surface for each of the values 


as ee ee 


namely a sphere with | — 4x handles, and there is just one nonorientable 
closed surface for each of the values 
x=1, 0,-1, -2,..., 

* A disk is a circle plus its interior. For other topological properties of the disk and sphere, 
see A. W. Tucker. Proceedings of the First Canadian Mathematical Congress (University of To- 
ronto Press, 1946). 

+ Brahana’s original proof has been simplified by Lefschetz [1, pp. 72-85] and others. One of 
the best expositions is by R. C. James, Combinatorial topology of surfaces, Mathematics Maga- 
zine, 29 (1955), pp. |-39. 
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namely, a sphere with 2 — x cross-caps. We have already described, on 
such a surface, a very simple map having one vertex, 2 — x edges, and one 
face. In the orientable case, this map is “regular” in the following sense. 

The vertices, edges, and faces of a map (on any closed surface, orientable 
or nonorientable) may conveniently be called the elements of the map. Those 
permutations of the elements which preserve all the relations of incidence 
are called automorphisms of the map. The automorphisms form a group 
(of order | or more) called the group of the map. This is a natural general- 
ization of the symmetry group of a polyhedron or tessellation (§ 15.7), but 
metrical ideas are no longer used. A map is said to be regular if its auto- 
morphisms include the cyclic permutation of the edges (and vertices) be- 
longing to any one face and also the cyclic permutation of the edges (and 
faces) that meet at any one vertex of this face. Such a map is “of type 
{p, q}” if p edges belong toa face, and g toavertex. The dual map, whose 
edges cross those of the original map, is of type {q, p}. (The letters p and q 
used here have no connection with our previous use of p and q for the num- 
bers of handles and cross-caps.) 

The equations 10.31 remain valid. Combining them with 21.11, we ob- 
tain a generalization of 10.32: 


21.31 V = 2pr, E = pqr, F = 2er, 
where, if x 4 0, 


x 
21.32 =e pe 
If x = 0, so that 2p + 2q = pq as in §4.6, there are infinitely many possible 
values for r, as we shall soon see. 
If x = | or 2, the possible values for p and g are given by 10.33 without 
the restrictions p > 2, q >> 2. Thus the regular maps on a sphere (x = 2) 
are just the spherical tessellations 


{p,2}, {2,p}, {3,3}, 
21.33 {4, 3}, an ae {3,5}, 


namely: the dihedron whose p vertices are evenly spaced along the equa- 
tor, the hosohedron* whose edges and faces are p meridians and p lunes, and 
“blown-up” variants of the five Platonic solids. All these are centrally sym- 
metrical, except the dihedron and its dual with p odd, and the tetrahedron 
{3,3}. In the centrally symmetrical cases we can identify antipodes to ob- 
tain the regular tessellations of the elliptic plane (x = 1): 


{p,2}/2 and {2,p}/2  (peven), 
{4,.3\72, 43,4}72,- 45,3)/2, 13,372 


[Coxeter and Moser 1, p. 111]. For instance, identifying opposite elements 


21.34 


* This term (literally “any number of faces”) was coined by Vito Caravelli (1724-1800), whose 
Traité des hosoédres was published in Paris (1959) by the Librairie Scientifique et Technique. 
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of the cube (Figure 10.55), we see that {4, 3}/2 is a partition of the elliptic 
plane (or of the real projective plane) into three “squares,” say ABCD, 
ACDB, ADBC, (These squares are just the three handkerchiefs which Lady 
Muriel began to sew together in her attempt to make the Purse of Fortuna- 
tus [Dodgson 4, pp. 100-104]. The first two, joined along their common 
side CD, form a Mobius strip whose boundary is ADBC.) Likewise, {5,3}/2 
is a partition of the elliptic plane into six pentagons, each of which is sur- 
rounded by the remaining five. 


The regular maps on the torus are derived from infinite regular maps on 
its universal covering surface, which is the Euclidean plane. As we saw in 
§ 4.6, these infinite maps are the regular tessellations 


21.35 {6,3}, {4,4}, {3, 6}. 


The necessary identifications are determined by subgroups of the translation 
groups of these tessellations. 

The vertices of {4,4} may be taken to be the lattice of points whose Car- 
tesian coordinates (x, y) are integers. The torus is derived by identifying op- 
posite sides of a square, one of whose sides goes from (0, 0) to (5, c), where 
b and ¢ are positive integers or zero (but not both zero). Since the area of 
this square is 6? 4 c?, the part of the original {4, 4} that lies inside it con- 
sists of b? + c? unit squares. We thus find, on the torus, a map 


{4, 430.6 
in which 
V = b? 4+ ¢?, E=2¥V, Pav 


[Coxeter and Moser 1, p. 103]. In particular, {4, 4}, 0 is the map (having 
one vertex and one face) which we used in § 21.1 when we unfolded the torus 
after cutting it along the two edges of the map. (If it seems paradoxical for 
a map of type {4, 4} to have only one vertex, we must recognize that the 
face is still quadrangular even though its four vertices all coincide with the 
single vertex of the map.) The map {4, 4}21, whose five faces are each sur- 
rounded by the remaining four, is shown in Figure 21.3a. 


A 


Figure 21.3a Figure 21.3b 
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Similarly, the vertices of {3, 6} may be taken to be the lattice of points 
whose oblique coordinates, with axes inclined at 60°, are integers. The torus 
is derived by identifying opposite sides of a rhombus of angle 60°, one of 
whose sides goes from (0, 0) to (b, c). Since the area of this rhombus is 
b? 4 be + c? times that of the unit cell of the lattice (consisting of two ad- 
jacent faces of {3, 6}), the part of {3, 6} that lies inside it consists of 
2(b2 + be + c?) equilateral triangles. We thus find, on the torus, a map 


{3, 6}a,c 
in which 
V = b? + be + c?, E= 3Y, F=2V 


[Coxeter and Moser 1, p. 107]. (For an affine variant of one-half of {3, 6}2,1, 
see Figure 13.5¢c.) The dual map 


{6, 3}a.c 


has b? + be + c? hexagonal faces. In particular, {6, 3}21 (Figure 21.3b)* 
is Heawood’s partition of the torus into seven hexagons, each of which is 
surrounded by the remaining six. 

Thus we see that the torus admits infinitely many regular maps of each 
of the three types 21.35. On the other hand, there are no regular maps on 
the Klein bottle [Coxeter and Moser 1, p. 116]. 


If a regular map has more than one vertex and more than one face, every 
edge joins two vertices and separates two faces. If there is an automorphism 
which interchanges these two vertices without interchanging the two faces 
(in which case there is another automorphism which does vice versa), the 
map is said to be reflexible [Ball 1, p. 129; Coxeter and Moser 1, p. 101]. 
Clearly, all the regular maps on the sphere and all those on any nonorien- 
table surface are reflexible, but those on the torus are reflexible only if 
be(b — c) = 0. It was suggested by Coxeter and Moser [1, p. 102] that 
possibly all regular maps on more complicated surfaces (i.e., on surfaces of 
negative characteristic) are reflexible. However, this conjecture is refuted 
by J. R. Edmonds’ discovery of a nonreflexible regular map+ of type {7, 7} 
and genus 7, having 8 vertices, 28 edges, and 8 heptagonal faces 


FDCGBEH, GEDACFH, AFEBDGH, BGFCEAH, CAGDFBH, 
DBAEGCH, ECBFADH, ABCDEFG. 
EXERCISES 
1. Describe the maps {2, 1} and {1,2} on the sphere. (The former has one face, 
a digon {2}; the latter has two faces which are monogons {1}.) 
2. The elliptic tessellation {2g, 2}/2 has q vertices and g edges, all on one line, 


*For other ways of drawing Heawood’s map, see Coxeter, Map-coloring problems, Scripta 
Mathematica, 23 (1957), pp. 19-21, and The four-color map problem, 1840-1890, Mathematics 
Teacher, 52 (1959), pp. 288-289. 

¥ See also Robert Frucht, Canadian Journal of Mathematics, 4 (1952), p. 247. 
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and one face. Its dual, {2, 2q}/2, has one vertex, q complete lines for its edges, and g 
angular regions for its faces. Describe the remaining regular tessellations of the elliptic 
plane. 

3. As we have seen, the standard decomposition of the torus provides a one-faced 
map {4, 44,0. The standard decomposition of the Klein bottle (as in the second part 
of Figure 21.2c) provides another one-faced map of type {4, 4}, but this is not regular. 
In both cases, the one vertex and the two edges form a very simple graph, which may 
be described roughly as a figure of eight. The same graph can be drawn on the pro- 
jective plane to form {2, 4}/2, or on the sphere to form an irregular map whose faces 
consist of a digon and two monogons. 

4. The standard decomposition of a sphere with three cross-caps (as in the third 
part of Figure 21.2c) provides an irregular one-faced map of type {6, 6}. Its one ver- 
tex and three edges form a “clover-leaf” having three loops. The same graph can be 
drawn on the Klein bottle as an irregular map of type {3, 6}, on the torus as (3, 6}10, 
on the projective plane as {2, 6}/2, and on the sphere as an irregular map whose faces 
consist of a triangle and three monogons. 

5. Describe the reflexible maps 


{4,4}14, {4420 {3,6}15, {6.3}15, {6,3}e0 


6. The vertices and edges of {6, 3}1, form the Thomsen graph (see the end of 
§ 21.1). Those of {4, 4}2 2 form an analogous graph having eight vertices instead of six. 

7. A graph is called a complete V-point if every two of its V vertices are joined by 
an edge. The vertices and edges of the following maps form complete V-points (for 
which values of V'?): 


{3,2}, {3,3}, {4,3}/72, {4,4}o1, {3,5}/2, {3,6}e1. 


8. The vertices and edges of Edmonds’ map form a complete 8-point. 
9. There is no map of type {1,1}. (Hint: Set p = q = | in 21.31 and 21.32.) 


21.4 THE FOUR-COLOR PROBLEM 


“| doubt it,” said the Carpenter, 
And shed a biter tear. 


Lewis Carroll 
[Dodgson 2, Chap. 4] 


The theory of maps on surfaces may be said to have begun in 1840, when 
MOdbius puzzled his students with the problem of dividing a country into five 
districts in such a way that every two would have a common boundary line 
(not merely acommon point). The impossibility of such a partition led nat- 
urally to the question whether four colors always suffice for coloring a map 
when we stipulate that different colors are needed wherever two districts 
share a boundary line or, in mathematical terms, wherever two faces share 
an edge. It must be emphasized that each face is simply connected (i.e., 


topologically a disk). Thus the geographical problem applies to a single 
island or continent: the ocean and all the other islands and continents are 
to be taken together as forming one more face, and if this is a blue face the 
same color must be allowed for some of the other faces. For instance, in 
a map of Europe, we need three different colors (say green, red, and yellow) 
for Belgium, France, and Germany; Holland may have the same color as 
France, but Luxembourg must be blue, like the sea. 

Figures 15.4a, b, ¢ illustrate the use of four colors for the tetrahedron and 
octahedron, and five for the icosahedron. For the tetrahedron, four is the 
only possible number, since each face meets all the others. Apart from this 
simplest case, any map whose faces are triangles can be colored in three colors.* 
Moreover, any map having an even number of faces at each vertex (such 
as the octahedron) can be colored in two colors, like a chessboard. 


The problem of deciding whether four colors suffice for coloring any map 
on a plane or a sphere is sometimes called Guthrie's problem, after Francis 
Guthrie, who took his B.A. in London in 1850 and his LL.D. in 1852. Be- 
tween these dates the problem occurred to him while he was coloring a map 
of England. He tried in vain to prove that four colors are always sufficient. 
On October 23, 1852, his younger brother Frederick communicated the con- 
jecture to Augustus De Morgan (author of A Budget of Paradoxes). In 1878, 
Cayley revived interest in the problem at a meeting of the London Mathe- 
matical Society by asking whether anyone had proved the conjecture. In 
1880, Cayley’s challenge was answered by A. B. Kempe and P. G. Tait, who 
published plausible arguments, which were accepted for ten years (even by 
Klein himself), as proving that four colors will always suffice. In 1890, Hea- 
wood drew attention to the fallacy in Kempe’s argument, using for a.coun- 
terexample a particular map having 18 faces. The number of faces can 
actually be reduced to 9, so as to reveal the fallacy more quickly.+ 

In §§ 21.5-21.7 we shall describe the valid part of Kempe’s work and also 
Heawood’s extension to maps on the torus and other multiply connected 
surfaces. 


EXERCISES 


1. In how many essentially different ways can a cube be colored with three given 
colors, a dodecahedron with four? 

2. In Figure 15.4c, the icosahedron is colored with five colors so that each face and 
its three neighbors have four different colors. Replace each “e” by the one remaining 
color, thus reducing the number of colors to four. Starting afresh, color the icosahe- 
dron with three colors [Ball 1, pp. 238-241]. 

3. Try to draw a map that is difficult to color with four colors. 


*RL, Brooks, On coloring the nodes of a network, Proceedings of the Cambridge Philosophical 
Society, 37 (1941), pp. 194-197. 
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21.5 THE SIX-COLOR THEOREM 


Even the moonlit track ahead of him faded from his consciousness, for 
into his head had come a theorem which might be true or might be 
false, and his mind darted hither and thither seeking proofs to esfab- 
lish its truth and counter-examples to show that it could not possibly 
be true. 


J. L. Synge [2, p. 165] 


The equations 10.31, which apply to a regular map of type {p, gq}, remain 
valid for a general map having various kinds of face and various numbers of 
faces at a vertex, provided we interpret p as the average number of vertices 
(or edges) of a face, and q as the average number of faces (or edges) at a 
vertex. Since a vertex belonging to only two edges can be omitted by com- 
bining the two edges, there is no real loss of generality in assuming that every 
vertex belongs to at least three edges. Thus g > 3 and 


2E = qV > 3Y, 
whence, by 21.11, E < 34E — V) = 3(F — x) and 
2E x 
21.51 == <o6(1-%). 
= ae i F 
This proves that p < 6 whenever x > 0, that is, for the sphere (x = 2) or 


the projective plane (x = 1). Hence 


21.52 Every map on the sphere or the projective plane has at least one face 
whose number of edges is less than 6. 


We can now prove, by induction over the number of faces, 


THE StX-COLOR THEOREM. To color any map on the sphere or the projec- 
tive plane requires at most six colors. 


We take “any map” to mean “any map having F faces,” for each particu- 
lar value of F. When F < 6 there is no problem: we can assign distinct 


Figure 21.5a Figure 21.5b 
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colors to all the faces. The theorem will be proved if we can deduce the 
case of 7 faces from the case of 6, and then 8 from 7, and soon. Accord- 
ingly, we make the inductive assumption that the theorem holds for every 
map of F — 1 faces, and then proceed to investigate a given F-faced map, 
paying particular attention to the face (or one of the faces) having 5 or fewer 
edges (see 21.52). For definiteness, we assume this face to be a pentagon, 
like the shaded face in Figure 21.54. (The same arguments can be carried 
through with trivial changes if it is a quadrangle, triangle, or digon.) Fig- 
ure 21.5b shows a modified map in which this pentagonal face has shrunk 
to a point, that is, in which its territory has been ceded to its five neigh- 
bors. By the inductive assumption, the modified map, having only F — 1 
faces, can be colored with six colors. Let this be done, and let the same 
coloring be applied to the original map. Then, even if the five neighbors 
need five distinct colors, there is still a sixth color left for the pentagonal 
face itself. 

Since this argument can be applied with F = 7, then with F = 8, and so 
on, the six-color theorem holds for all values of F. 


Can the number 6 be replaced by 5? For the projective plane it cannot, 
as we shall soon see. For the sphere it can, by a subtler argument depend- 
ing on the topological theorem that a circle on the sphere decomposes it 
into two separate regions [Ball 1, p. 229]. But the gap between the five colors 
that are always sufficient and the four that are usually necessary has never 
been bridged. Heawood himself continued to investigate the problem for 
the rest of his life, reducing it to pure algebra. Other authors have gradu- 
ally increased the lower bound of the number of faces for a map that might 
possibly require five colors. 

It is almost certainly a mere coincidence that the numbers 4 and 5 play an analogous role in 
arithmetic. According to Mordell [1, p. 19], it is “very easy to prove that every integer is the 


sum of at most five integer cubes, positive or negative, and there is an unproved conjecture that 
four cubes suffice.” 


For the projective plane, on the other hand, there is no gap to be bridged: 
six colors are both necessary and sufficient. The simplest map that needs 


Figure 21.5c 
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all six is {5, 3}/2 (see p. 387), which is drawn in Figure 21.5c as a disk with 
diametrically opposite points identified. By cutting out a hole round the 
vertex A, we obtain H. Tietze’s six-color map on the Mobius strip (Figure 
21.5d). Here, as on the whole projective plane, six colors are both neces- 
sary and sufficient. 


c “6 op 3 
B J 
E 9 i 
a B 
G 3 il T A Fi ] 


Figure 21.5d 


EXERCISE 


Make a model of the Mébius strip and color it as indicated in Figure 21.5d. (Since 
this is a “one-sided” surface, the paper must everywhere have the same color on both 
sides.) 


21.6 A SUFFICIENT NUMBER OF COLORS FOR ANY SURFACE 


[ Huck Finn to Tom Sawyer in their flying boat:] ‘We're right over Iili- 
nois yet. And you can see for yourself that Indiana ain't in sight... . 
lilinois is green, Indiana is pink. You show me any pink down here, 
if you can. No, sir; it’s green.”’ 

“Indiano pink? Why, what a lie!” 


“It ain't no fie; I've seen it on the map, and it's pink.” 


Mark Twain (=S. L. Clemens, 1B35-1910} 
(Tom Sowyer Abroad, Harper, New York, 1B96, Chop. 3} 


The problem of coloring maps on a more complicated surface is not dif- 
ficult, as on the sphere, but easy, as on the projective plane. In fact, we 
can now prove 


HEAWOOD’S THEOREM. To color any map on a surface of characteristic 
xX < 2 requires at most | N] colors, where 


na lt V49 = 24x 
= Yo. 


Since the case x = | has already been proved in § 21.5, we shall suppose 
that 


xX <0. 


Since the theorem is obviously true when F < N (which implies F < [N]), we 
shall suppose also that 


F>N 
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and use induction over the number of faces, assuming that [ NV ] colors suffice 
for any map having F — | faces. Since N satisfies the quadratic equation 


N2 — IN + 6x = 0 
x 
6 —-*|}=N-—1 
= ( a 
the inequality 21.51 yields 


x x) _ 
p<6(1-%) <6 (1-¥% | =n- 1 
Hence there is at least one face having [N] — | or fewer edges (cf. 21.52). 
We continue as in § 21.5, using [N] instead of 6, and conclude that [N] colors 
suffice for the given map. 


Although this proof would break down if x were positive, we note that 
Heawood’s expression for N not only yields the correct value 6 when x = | 
but also yields the conjectured value 4 when x = 2. 


EXERCISE 
Tabulate [ NV} for values of x from 2 down to —9. 


21.7 SURFACES THAT NEED THE FULL NUMBER OF COLORS 


“Suppose there's a brown calf and a big brown dog, and an artist is 
making a picture of them.... He has got to paint them so you can 
tell them apart the minute you look at them, hain't he? Of course. 
Well, then, do you want him to go and paint both of them brown? 
Certainly you don’t. He paints one of them blue, ond fhen you can't 
make no mistake. {t's just the same with maps. That's why they make 
every state a different color... .”’ 


Mark Twoin (ibid.) 


Heawood’s theorem (with x = 0) tells us that every map on the torus can 
be colored with seven colors. His regular map {6, 3}21, whose seven faces 
all meet one another, shows that at least one map on the torus really needs 
seven colors. Since Heawood’s expression for N depends only on x, it yields 
the same number 7 for the Klein bottle. However, Philip Franklin has 
proved a six-color theorem for the Klein bottle.* 

Ringel [1, p. 124] proved that the Klein bottle is the only nonorienta- 
ble surface not needing as many as[N] colors. For instance, inserting a 
cross-cap at one vertex of Heawood’s seven-faced map on the torus, as in 
Figure 21.25, we obtain a seven-faced map on the surface of characteristic 
~1. This map still needs seven colors, since all its faces meet one another 
(In fact, some pairs of faces meet twice.) 


* Coxeter, Scripta Mathematica, 23 (1957), pp. 21-23. 
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In the case of an orientable surface of genus p = | — $x, Heawood’s 
number [NV] is given by 


N 74 V48p 4+ 1 
ane aie 

and it is appropriate to give the name “The Heawood conjecture” to the 
statement that, for every p, the orientable surface of genus p carries a map 
of [N ] faces, all meeting one another. This conjecture became a theorem in 
1968, when Ringel and Youngs* found such a map for every p. This break- 
through was the climax of a long story. The investigation was begun by L. 
Heffter in 1891 and was then neglected until 1952, when Ringel resumed it. 
He collaborated with Youngs from 1966 on, and some helpful ideas were 
contributed by W. Gustin, C. M. Terry, and L. R. Welch. The values of p 
up to 32 were disposed of independently by Jean Mayer (a professor of 
French literature) in 1967. The four most difficult cases (p = 59, 83, 158, 
257) were achieved by Youngs and Richard Guy in 1968. 


EXERCISES 
1. Replacing the “hole” in Figure 21.5c by a cross-cap, obtain a six-color map (of 3 
pentagons and 3 heptagons) on the Klein bottle. 
2. Draw an eight-color map on the surface of genus two [Ball 1, p. 237]. 


* Gerhard Ringel and J. W. T. Youngs, Solution of the Heawood map-colouring problem, 
Proceedings of the National Academy of Sciences (U.S.A.), 1968. 
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Four-dimensional geometry 


The idea of four-dimensional space has long been surrounded by an attrac- 
tive aura of mystery. The axiomatic approach (12.44) dispels the mystery 
without reducing the fascination. Having become accustomed to non- 
Euclidean geometries, we are no longer disconcerted by the possibility that 
two planes may have a common point without having acommon line. More 
simply, we may regard the points of Euclidean 4-space as having four Car- 
tesian coordinates instead of the usual two or three. Any two distinct points 
determine a line, the three vertices of a triangle determine a plane, and the 
four vertices of a tetrahedron determine a hyperplane, which is given by a sin- 
gle linear equation connecting the four coordinates. 

In §§ 22.1-22.3 we describe the four-dimensional analogues of the Platonic 
solids. We shall see that there are six of these regular polytopes. Each con- 
sists of a finite number of solid cells in distinct hyperplanes, so arranged that 
every face of each cell belongs also to another cell. All these regular poly- 
topes were discovered by Schlafli before 1855. 

Just as we can make flat pictures of solids by projecting them orthogo- 
nally onto a plane, so we can make flat or solid “pictures” of hypersolids by 
projecting them either onto a plane or onto a hyperplane. Instances of the 
former procedure are shown in Figures 22.la, b and 22.3b; for an example 
of the latter, see Plate III on page 404. 

In § 22.4 we consider certain honeycombs (or “solid tessellations,” or “de- 
generate polytopes”) consisting of infinitely many solid cells in the same 3- 
space. In § 22.5 we see how these ideas help to explain some experimental 
results on the packing of equal spheres. 

The geometry of this chapter is Euclidean. But all the other kinds of ge- 
ometry can similarly be extended to spaces of any number of dimensions. 
As L. Fejes Toth remarks in one of his books, we are able “to create an 
infinite set of new universes, the laws of which are within our reach, though 
we can never set foot in them.” 


nase 
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22.1 THE SIMPLEST FOUR-DIMENSIONAL FIGURES 


That ye... may be able to comprehend with all saints what is the 
breadth, and length, ond depth, and height. 


Ephesians Ill, 17 -1B 


Spirits have four dimensions. 


Henry More (1614 -16B7) 


ff an inhabitant of flatland was oble to move in three dimensions, he 
would be credited with supernatural powers by those who were unable 
so to move; for he could appeor or disappear af will, could (so far as 
they could fell) create matter or destroy it.... We may go one step 
lower, and conceive of a world of one dimension—like a long tube— 
in which the inhabitants could move only forwards and backwards. ... 
Life in line-land would seem somewhat dull... . An inhabitant could 
know only two other individuals; namely, his neighbours, one on each 
side. 


W. W. Rouse Boll 
(Mathematical Recreations and Essays, 9th edition, 1920, p. 426) 


When trying to appreciate the idea of Euclidean 4-space, we are helped by 
imagining the efforts of a hypothetical two-dimensional being to visualize a 
three-dimensional world.* In solid geometry we can find a line (“the third 
dimension’’) which is perpendicular to both of two intersecting lines and con- 
sequently perpendicular to every line in their plane. Analogously, in 4-space 
we can find a line (“the fourth dimension’) which is perpendicular to all three 
edges of a trihedral solid angle, such as a corner of a cube, and consequently 
perpendicular to every line in the 3-space that contains the solid angle. It 
follows that two 3-spaces that have a common point have a common plane, 
and the product of reflections in them is a rotation about this plane, analo- 
gous to the familiar rotation about a line in three dimensions or about a point 
in two dimensions. 

After accepting the idea of a fourth dimension, we can soon imagine a 
pyramid or a prism whose “base” is a solid. For instance, a regular tetra- 
hedron ABCD may serve as the base of a pyramid ABCDE (Figure 22.1a) 
whose apex E is along the fourth dimension through the center of ABCD. 
If E is so chosen that its distances from A, B, C, D are all equal to the edge 
AB, we have a regular simplex, which may be regarded in five ways as a 
pyramid, each vertex in turn serving as the apex while the remaining four 
form the base. 

Figure 22.1b is merely an octagon with a square drawn inwards on each 


* Flatland: A Romance of Many Dimensions, by A Square (E. A. Abbott), Boston, 1885 and 1928. 
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Figure 22.14 Figure 22.1b 


side, or an octagon and an octagram with corresponding sides joined by 
squares. It may be regarded as a picture of a hypercube (or “8-cell,” or “tes- 
saract,” or “measure polytope”) which is a prism whose base is a cube, the 
“height” of the prism being equal to the edge of the base. Just as a cube 
can be traced out by moving a square along the third dimension, so a hyper- 
cube can be traced out by moving a cube along the fourth dimension. In 
Figure 22.15 the initial and final positions of the moving cube have been 
drawn in heavy lines. There are altogether eight cubes: these two, and six 
others traced out by the six faces. Each of the 24 squares (which appear in 
the figure as either squares or rhombi) belongs to two of the cubes, not lying 
in the same 3-space but rotated about the plane of the square until the two 3- 
spaces are at right angles. 
The regular simplex and the hypercube are the two simplest instances 


{3, 3, 3}, {4, 3, 3} 


of a regular polytope {p, q, r}, which is a configuration of equal Platonic solids 
{p, q}, called cells, fitting together in such a way that each face {p} belongs to 
two cells, and each edge to r cells. It follows that the arrangement of the 
cells at a vertex corresponds to the arrangement of the faces of a {q, r}, in 
the sense that each face of the {q, r} is a vertex figure of the corresponding 
cell. This {q, r}, whose vertices are the midpoints of the edges at one ver- 
tex of {p, q, rr}, 1s naturally called the vertex figure of the polytope. In fact, 
the three-digit Schlafli symbol {p, g, r} is derived by “telescoping” the two- 
digit symbols { p, q} and {q, r} which denote the cell and the vertex figure. 

We can now complete the first two rows of Table IV (on p. 414), in which 
the numbers of vertices, edges, faces, and cells are denoted by No, N1, Ne, Ns. 
Although there is no easy formula for any of these numerical properties as a 
function of p, g, r, we can readily find their mutual ratios by arguments anal- 
ogous to those that led to 10.31. In fact, if V, E, F refer to the cell {p, q}, and 
V’, E’, F’ to the vertex figure {q¢, r}, we have 


FN3 = 2No, VN3 = F’No, V’No = 2N1, EN3 — rN = PNe = E’ No. 


For instance, the first equation comes from the observation that the F faces 
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of the N3 cells are just the No faces, counted twice because each belongs to 
two cells. 


EXERCISES 
1. The numbers No: Ni: No: Nz are proportional to 


Oe eer ee en ae 
ge Be ae Oe ge ge 2 


Thus {p, q, r} satisfies Schlafli’s four-dimensional analogue of Euler’s theorem: 
No — Ni + No — Neg = 0. 

2. A hypercube of edge 1, with one vertex at the origin and 4 edges along the Car- 
tesian axes, has the 16 vertices (x1, X2, X3, <4), where each of the four x’s is either 0 or 
1, independently. 

3. A hypercube of edge 2, with its center at the origin and its edges parallel to the 
Cartesian axes, has the 16 vertices 

GEL, 2211 Seb), 


4. Where is the center of the dilatation that relates the hypercubes described in the 
two preceding exercises? 


22.2 A NECESSARY CONDITION FOR THE EXISTENCE OF {p, q,r} 


“... Space... is spoken of as having three dimensions, which one may 
call Length, Breadth, and Thickness... . Buf some philosophical peo- 
ple have been asking why three dimensions particularly—why not an- 
other direction af right angles to the other three? ... | do not mind 
telling you | have been at work upon this geometry of Four Dimen- 
sions for some time... .” 


H. G. Wells 
(The Time Machine, 1B95, p. 5) 


It was apparently Kepler who first thought of the regular tessellations 
(§ 4.6) as infinite polyhedra. Analogously, the three-dimensional honey- 
comb of cubes (whose vertices may be taken to be all the points (x, y, z) 
for which x, y, z are integers) is the infinite polytope {4, 3, 4}: its cell is the 
cube {4, 3}, and its vertex figure is the octahedron {3, 4} whose eight faces 
are the vertex figures of the eight cubes that surround a vertex, one in each 
“octant.” The final 4 in the symbol {4, 3, 4} means that there are four cells 
surrounding an edge. These four cubes fit together without any interstices 
because the dihedral angle of the cube is exactly aright angle. On the other 
hand, the hypercube {4, 3, 3} is a finite polytope because the total angle at 
an edge is only three right angles, allowing the cells to be rotated out of 
the 3-space the way one derives a polyhedron by folding up its net (only now 
the angular deficiency is not related in any simple way to the number of 
vertices). 
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Similarly, since the dihedral angle of the tetrahedron {3, 3} is slightly less 
than 71° (see Table II), we may place three, four, or five (but no more) tet- 
rahedra together at a common edge, so as to begin the construction of 
{3, 3, 3}, {3, 3, 4}, or (3, 3,5}. Again, since the dihedral angles of the octa- 
hedron and dodecahedron are between 90° and 120°, we may place just three 
of either together at an edge to obtain {3, 4, 3} and {5, 3,3}. But the icosa- 
hedron cannot be used in this manner, as its dihedral angle is greater than 
120°. We have thus proved that the only possible finite regular polytopes 
in four dimensions are 


{3, 3, 3}, {3, 3,4}, {3,3,5}, {43,3}, (3,4, 3}, 65, 3, 3}. 


The condition for {p, q, r} to be a finite polytope may be expressed in 
general terms by recalling (from 10.43) that the dihedral angle of the Pla- 
tonic solid {p, q} is 


2 arcsin [ cos a sin | ; 


If r such angles together make less than 27, each must be less than 27/r. 
Hence 


arcsin | cos 2 / sin = uy 
| q | P > r 
that is, 
22.21 os= < sin@= sin. 
c q < sin p Hs 
Similarly, the condition for {p, g, r} to be an infinite honeycomb filling three- 
dimensional space is 


22.22 cosZ= = sn = sin@: 
q p r 


an equation for which the only solution in integers greater than 2 is {4, 3, 4}. 
EXERCISES 


1. The condition 22.21 implies both 10.33 and the analogous inequality with p re- 
placed by r. Hint: 


© 


P 
2. Obtain the Schlafli symbol for the regular polytope whose eight vertices are 


(+1, 0, 0,0), (0, +1, 0,0), (0,0, =1,0), (0, 0,0, =), 


sin” sin™ < sin 
P r 


that is, for the polytope 
[xa] + [ee] + faa] + [tat <L 
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22.3 CONSTRUCTIONS FOR REGULAR POLYTOPES 


Though analogy is often misleading, it is the least misleading thing 
we have. 
Samuel Butler (1B35 -1902) 


(Music, Pictures, and Books) 


We have seen that the inequality 22.21 is a necessary condition for the 
existence of a finite polytope {p, g, r}. The sufficiency of the condition re- 
quires an actual construction for each of the six figures. We know that r 
cells can fit together at an edge, but it is not obvious that the addition of 
further cells will ultimately yield a closed configuration in which every face 
of every cell belongs also to another cell. 

As the complete story of such constructions is very long [Coxeter 1, pp. 
145-153], we must be content with a brief sketch, aided by analogy with 
what happens in three dimensions. 

We recall that alternate vertices of a cube {4, 3} belong to an inscribed 
tetrahedron {3, 3} whose four faces correspond in an obvious manner to 
the four omitted vertices of the cube, whereas its six edges are diagonals of 
the six faces of the cube (one diagonal of each face). Moreover, the mid- 
points of these six edges, being the centers of the faces of the cube, are the 
vertices of an octahedron {3, 4}. 


Figure 22.3a 


Analogously, by selecting alternate vertices of the hypercube {4, 3, 3} we 
obtain a polytope which has 8 vertices (the black points in Figure 22.32) 
and 16 cells: one tetrahedron (such as BCPQ) corresponding to each of the 
8 omitted vertices, and another (such as A BPQ) inscribed in each of the 8 
cubic cells. This “16-cell” has 24 edges, which are diagonals of the 24 square 
faces of the hypercube (one diagonal of each face). Each of these 24 edges 
belongs to 4 tetrahedra (2 of each type, occurring alternately); for example, 
the edge PQ belongs to the 4 tetrahedra 


ABPO, BCPO, CDPO, DAPO, 
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the first and third of which are inscribed in two adjacent cubes whose com- 
mon face has PQ for one of its diagonals. We have thus proved that the 
16-cell is {3, 3, 4}. 

Completing the third line of Table IV, we observe that the numerical prop- 
erties of {3, 3, 4} are just those of {4, 3, 3} in the reverse order. In fact, 
instead of obtaining the vertices of the 16-cell as alternate vertices of the 
hypercube, we could have obtained the vertices of another (similar) 16-cell 
as the centers of the cells of the hypercube. In other words, the hypercube 
and the 16-cell are reciprocal polytopes [Coxeter 1, p. 127], like the cube 
and the octahedron. More generally, the reciprocal of {p, q, r} is {r, q, P}- 


The midpoints of the 24 edges of {3, 3, 4} are the 24 vertices of a poly- 
tope whose cells are 24 octahedra: the vertex figures at the 8 vertices of 
{3, 3, 4}, and 16 inscribed in the 16 tetrahedra. Since all its cells are octa- 
hedra {3, 4}, this “24-cell” is {3, 4, 3} [Hilbert and Cohn-Vossen 1, p. 152, 
Fig. 172]. 

By suitably dividing the 12 edges of an octahedron in the ratio 7 : 1, we 
obtain the 12 vertices of an icosahedron (see § 11.2). By dividing the 96 
edges of the 24-cell {3, 4, 3} in this same ratio, we obtain the 96 vertices of 
a semiregular polytope s{3, 4, 3} (the “snub 24-cell’), whose cells consist 
of 24 icosahedra and 120 tetrahedra: namely, at each vertex of the 24-cell, 
a set of 5 tetrahedra consisting of | surrounded by 4 others (like a partially 
folded “net” for the regular simplex {3, 3, 3}). When each icosahedral cell 
of s{3, 4, 3} is capped by an icosahedral pyramid (the way an icosahedron 
is derived from a pentagonal antiprism by adding two pentagonal pyramids), 
we obtain a new polytope having a cluster of 20 tetrahedra to replace each 
of the 24 icosahedra, making a total of 


24°20 4+ 120 = 600 


tetrahedra. The 120 vertices of this polytope consist of the 96 vertices of 
s{3, 4, 3} and the 24 apices of the 24 icosahedral pyramids. (These 24 points, 
corresponding to the cells of the original {3, 4, 3}, are the vertices of a re- 
ciprocal {3, 4, 3}.) By careful examination [Coxeter 1, pp. 152-153] we 
find that every edge belongs to 5 of the 600 tetrahedra. Hence the 600-cell 
is {3, 3, 5} [Coxeter 1, frontispiece]. 

Finally, the 120-cell {5, 3, 3} (Figure 22.3b and Plate III) can be con- 
structed as the reciprocal of {3, 3, 5}: its 600 vertices are the centers of the 
600 tetrahedra. This information enables us to complete Table IV. 


It is interesting to record that the snub 24-cell s{3, 4, 3}, which plays such 
a useful role in the above construction for {3, 3, 5}, was discovered by 
Thorold Gosset in 1897.* Figure 22.3b was drawn by B. L. Chilton. 
Plate III is a photograph of a wire model made by P. S. Donchian. 


* Gosset was born in 1869 and died in 1962 [see Coxeter 1, pp. 162-164]. 


CHILTON’S DRAWING OF {5, 3, 3} 403 


TF REECE RY 
ZI 
7p Lye 56 aN < 
ga a [> 


NS 


Vi 


rS wes. Lf SAR ON 2 
Wi, PIRATE RY ORNS 
Aiiaw: Sean coKPe 


ee 
se 


nas 


ais 
SOA 
JPY I} 
Sse 
Le 


i 


is 


aaa 
SKS 
' 
eS. 
AS 


Vy 
Wid 
Wey < RO ade 
Wats Gk 
oe 


N 


CL 


mo \ 


my 


L/ 
| 
‘ 


“LZ 


Lhe 


XSCALE 


ce, 
ar 
TX 
x 
at 
Sy 4 
wl 
se 
{> 
LA 
(7 
7 


awa 
Vo 
(ee 
PA 
as 
a 
KS . } 
} é U4 
XY 
4 
gS 
£Y 
ree 
ees 


Teepe) 


Figure 22.3b 


Hartley [1, Nos. 56, 60] has given instructions for making models of a 
tetrahedron with an icosahedron placed on each face and of a dodecahe- 
dron with a dodecahedron placed on each face. When completed, these 


models show how we might begin to make solid nets for s{3, 4, 3} and 
{5, 3, 3}, respectively. 


1. 
Zs 


EXERCISES 


Locate the centers of the 8 cubic cells of the hypercube (£1, £1, £1, £1). 
Locate the midpoints of the 24 edges of the 16-cell 


(222, 0, 0, 0), (0, =e; 0, 0), (0, 0, a2, 0), (0, 0, 0, 2). 
Verify that the 96 vertices of s{3, 4, 3}, which are 


(+1, +1, 1-1, 0), 
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evenly permuted, divide the 96 edges of the 24-cell (+1, +1, 0, 0) (permuted) in the 
ratio Tt: I. 
4. The 120 vertices of the 600-cell (3, 3, 5} are the 96 vertices of the above poly- 
tope s{3, 4, 3}, along with the 24 extra points 
(2, 0,0,0) (permuted) and (+1, +1, +1, +1), 
5. The 600 vertices of the 120-cell {5, 3, 3} are the permutations of 


(+2, +2,0,0, (+5, +1, +1, +), 
(Era Ete ?), (er? ceaw dh) eed, eat) 


along with the even permutations of 
(12, 1-2, +1, 0), (£V5, $174, 1,0), (42, £1, +7, 17), 


(This corrects an error in the first edition of Coxeter 1, p. 157.) 


PLATE II} 
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22.4 CLOSE PACKING OF EQUAL SPHERES 


As the foot presses upon the sand when the falling tide leaves it firm, 
that portion of it immediately surrounding the foot becomes momen- 


tarily dry.... The pressure of the foot causes dilatation of the sand, 
and so more water is [drawn] through the interstices of the surround- 
ing sand. .., leaving it dry until a sufficient supply has been obtained 


from below, when it again becomes wet. On raising the foot we gen- 
erally see that the sand under and around it becomes wet for a little 
time. This is because the sand contracts when the distorting forces 
are removed, and the excess of water escapes at the surface. 


Osborne Reynolds (1842 -1913) 
(British Association Report, Aberdeen, 1885, p. 897). 


Of all the two hundred thousand million men, women, and children 
who, from the beginning of the world, have ever walked on wet sand, 
how many, prior to the British Association Meeting at Aberdeen in 
1885, if asked, ‘Is the sand compressed under your foot?’ would have 
answered otherwise than 'Yes!"'? (Contrast with this the case of walk- 
ing over a bed of wet sea-weed!) 


Lord Kelvin (1824 -1907} 
(Baltimore Lectures, 1904, p. 625) 


Figure 22.44 Figure 22.4b 


Figures 22.4a and b show two possible ways of packing equal circles in a 
plane: the incircles of the faces of the regular tessellations {4,4} and {6, 3} 
($4.6). It is intuitively obvious that the latter is the more “economical” 
packing. To make this idea precise, we consider the incircles of the faces 
of the general regular tessellation {p, q}, and define the density of the pack- 
ing to be the ratio of the area of a circle to the area of the {p} in which it 
is inscribed. The density so defined is evidently less than 1, and the closest 
packing will have the greatest density, that is, the density nearest to 1. If 
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PLATE IV 


the face is a p-gon of side 2/, its inradius is r = / cot w/p and its area is pir 
(see 2.91, 2.92); therefore, the density is 


me _7PF 7 oF = /an®. 

pir pl ppp Pp 
This is an increasing function of p, and tends to | when p tends to infinity. 
But since the p-gon is a face of a regular tessellation, the only relevant val- 
ues of p are 3, 4, 6. Therefore the “best” value of p is 6, and the closest 
regular packing consists of the incircles of the faces of {6, 3}, the density 
being 


7 — 0.9069... 


T got 0 
cot = 6 V3 3 


6 (6 
[Hilbert and Cohn-Vossen 1, p. 47]. 
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It can easily be proved that this is still the closest packing when we aban- 
don the requirement of regularity but insist instead that the centers of the 
circles form a lattice [Hilbert and Cohn-Vossen 1, pp. 33-35]. Actually, 
even this restriction can be abandoned [Darwin 1, p. 345; Fejes Toth 1, p. 
58], as the bees discovered millions of years ago (Plate IV). 


An analogous packing of spheres in three-dimensional space may be ob- 
tained by taking the inspheres of the cells of a honeycomb of equal poly- 
hedra. The density is naturally defined as the ratio of the volume of a sphere 
to the volume of the cell in which it is inscribed. In the case of {4, 3, 4}, 
the honeycomb of cubes of edge 2/, this is 

7B og 


= = 05236... 
ope 


A greater density can be obtained by using the midspheres (§ 10.4) of alter- 
nate cells, as we shall soon see. 


a 


LIN] 


Figure 22.4c 


If we imagine the cells of the cubic honeycomb to be colored alternately 
black and white, like a three-dimensional chessboard, we may dissect each 
white cube into six square pyramids (by planes Joining pairs of opposite 
edges) and attach each pyramid to the neighboring black cube. Each black 
cube is now covered with six white pyramids, one on each face, to form a 
rhombic dodecahedron (Figure 22.4c), whose twelve rhombic faces have the 
twelve edges of the black cube for their shorter diagonals [Steinhaus 2, p. 
152]. Thus the insphere of the rhombic dodecahedron is the midsphere of 
the cube, of radius \/2/, and the volume of the rhombic dodecahedron is 
twice that of the cube, namely, 2(2/)3 = 168. In the honeycomb of such 
larger cells, each insphere is the midsphere of a black cube, and such spheres 
touch one another at the centers of the rhombic faces, that is, at the mid- 
points of the edges of the original honeycomb of cubes. Thus each sphere 
touches twelve others, the points of contact being the midpoints of the twelve 
edges of a cube. The density of this cubic close packing is evidently 
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4 3 
3m 21) a 2 — 0.740. 
1628 =o .74048 ... 


[Hilbert and Cohn-Vossen 1, p. 47]. 

The rhombic dodecahedron occurs in nature as a crystal of garnet, and the 
three-dimensional chessboard occurs as the arrangement of atoms in a crys- 
tal of common salt, with a sodium atom in each black cube and a chlorine 
atom in each white cube (or vice versa). The centers of the black cubes, 
which are the centers of the spheres in cubic close packing, are easily seen 
to form the face-centered cubic lattice. It follows from § 18.4 that this is 
the densest possible packing of spheres whose centers form a lattice. 


In old war memorials we often see a pyramidal pile of cannon balls: one 
at the apex resting on four others which, in turn, rest on nine, and so on. 
Each interior ball touches 12 others: 4 in its own layer, 4 above, and 4 be- 
low. In fact, these cannon balls are arranged in cubic close packing [Kep- 
ler 1, pp. 268-269]. The base of the square pyramid consists of (say) n? 
balls arranged like the circles in Figure 22.4a. When n is large, the shape 
of the whole pyramid is essentially the “top” half of a regular octahedron 
(regarded as a square dipyramid), each sloping face is an equilateral tri- 
angle formed by | + 2... + 7 balls. 

By turning the pyramid over so that such a sloping face becomes hori- 
zontal, we obtain a different aspect of the same packing. In this aspect we 
begin with a horizontal layer of spheres whose “equators” are the incircles 
of the hexagons of {6, 3}, as in Figure 22.45. The next higher layer is just 
like this but shifted slightly to the right (say), so that each sphere rests on 
three, its center being vertically above a vertex of {6, 3} from which an edge 
goes off to the right. Since all the centers form a three-dimensional lattice, 
the spheres in the third layer (resting on the second) are shifted again to 
the right, so that each center is vertically above a vertex of {6, 3} from which 
an edge goes off to the left. The fourth layer is vertically above the first, 
and thereafter the sequence recurs. 

In 1883, the crystallographer Barlow described an equally dense packing 
in which the centers do not form a lattice. This can be derived by taking 
the same horizontal layers in a different order. More precisely, we discard 
the “third layer’ just described and substitute a new third layer vertically 
above the first. Then we add a fourth layer vertically over the second, and 
so on; the shifting from one layer to the next is alternately to the right and 
left, like a zigzag. This nonlattice packing is called hexagonal close packing 
[Ball 1, p. 150; Hilbert and Cohn-Vossen 1, p. 46; Steinhaus 2, p. 170; 
Fejes Toth 1, pp. 172-173], 


Helpful models to illustrate these ideas are provided by fourteen golf balls and two shallow 
trays of dimensions 5 m. x 5 in. and 4.6 in. x 5.8 in., respectively. Either tray will hold nine 
balls in three rows of three. In the square tray, a pyramidal “cannon ball” arrangement can be 
completed by adding four more above and the remaining one at the top. In the oblong tray, 
the four balls in the second layer should have their centers at the vertices of a rhombus, not a 
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rectangle. The third layer is again represented by just one ball, but now there are two possible 
Positions for it: one belongs to cubic close packing and the other to hexagonal close packing. 

Since hexagonal close packing has the same density as cubic close pack- 
ing, namely 0.74048 ... , it is natural to ask whether some still less systematic 
packing (without any straight rows of spheres) may have a greater density. 
This remains an open question. The best theoretical approach to an an- 
swer is the proof by Rogers* that, if such a packing exists, its density must 
be less than 0.7797... . 

Experiments in this direction began as long ago as 1727, when Stephen 
Hales stated, in his Vegetable Staticks, 


I compressed several fresh parcels of Pease in the same Pot, with a force equal to 1600, 
800, and 400 pounds; in which Experiments, tho’ the Pease dilated, yet they did not 
raise the lever, because what they increased in bulk was, by the great incumbent weight, 
pressed into the interstices of the Pease, which they adequately filled up, being thereby 
formed into pretty regular Dodecahedrons. 


Hales presumably reached his conclusion by observing some pentagonal 
faces on his dilated peas. They could not all have been regular dodeca- 
hedra. For, since the dihedral angle of the regular dodecahedron is less 
than 120° (see Table II on p. 413), three such solids with a common edge 
will leave an angular gap of about 10° 19’. In fact, dodecahedra {5, 3} are 
the cells of the configuration {5, 3, 3}, which is not an infinite three-dimen- 
sional honeycomb but a finite four-dimensional polytope. 

In 1939, the botanists J. W. Marvin and E. B. Matzke repeated Hales’s 
experiment, replacing his peas by lead shot, “carefully selected under a mi- 
croscope for uniformity of size and shape,” in a steel cylinder, compressed 
with a steel plunger at a sufficient pressure (40,000 pounds) to eliminate all 
interstices.+ When the shot were stacked in cannon-ball fashion and com- 
pressed, they became nearly perfect rhombic dodecahedra. But “if the shot 
were just poured into the cylinder the way Hales presumably put his peas 
into the iron pot, irregular 14-faced bodies were formed.” Almost all the 
faces were either quadrangles, pentagons, or hexagons, with pentagons pre- 
dominating. Another botanist examined cells in undifferentiated vegetable 
tissues, and concluded that the internal cells have an average of approxi- 
mately 14 faces, though the most prevalent shape (occurring 32 times among 
the 650 cells examined) had 13 faces: 3 quadrangles, 6 pentagons, and 4 
hexagons. The few cells that had only 12 faces were neither rhombic dode- 
cahedra nor regular dodecahedra. 

Matzke also made a microscopic examination of a froth of 1900 measured 
bubbles. “For 600 central bubbles examined, the average number of con- 
tacts was 13.7.” The commonest shape had again 13 faces: | quadrangle, 
10 pentagons, and 2 hexagons. 

*C. A. Rogers, The packing of equal spheres, Proceedings of the London Mathematical Society 
(3), 8 (1958), pp. 609-620. 


+E. B. Matzke, In the twinkling of an eye, Bulletin of the Torrey Botanical Club, 77 (1950), 
pp. 222-227. 
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In 1959, Professor Bernal* confirmed the prevalence of pentagonal faces 
by a remarkably simple experiment in which equal balls of “Plasticene” (oily 
modeling clay) were rolled in powdered chalk, packed together irregularly, 
and pressed into one solid lump. The resulting polyhedra were found to 
have an average of 13.3 faces. 

To test the possibility that a random packing of equal spheres might attain 
a density between 0.7405 and 0.7797, G. D. Scott poured thousands of ball 
bearings into spherical flasks of various sizes, gently shaking each flask as 
it was being filled. Assuming that the exceptional situation at the surface 
of the container will make the density 


p — eN-1/3 


for N balls, where p and « are constants, he found from these experiments a 
closest random packing with 


p = 0.6366, © = 0.33. 


By careful filling of the flasks without shaking, a loosest incompressible ran- 
dom packing was found with 


p = 0.60, e = 0.37. 


Since, for the closest random packing, p falls far short of 0.7405, it seems 
unlikely that any greater density can be maintained throughout a region 
that extends indefinitely in all directions. 

If we could fill a spherical flask with N ball bearings in cubic close pack- 
ing, we would expect the density to be expressible as a series beginning with 
the two terms 


0.7405 — eN-13, 


But this experiment does not seem to be feasible. A scholarly book has 
been written on the theory of lattice points in spheres} without throwing 
any light on the value of ¢ in this 3-dimensional case, although consider- 
able progress has been made on the analogous problem in spaces of other 
numbers of dimensions, such as 2 or 4. 

Whatever the closest random packing may be, it is clear from Osborne 
Reynolds’s experiment on the seashore that any small disturbance increases 
the size of the interstices. The same principle may explain a Hindu fakir’s 
magic trick, which was mentioned by Martin Gardner. A cylindrical jar 
with a rather narrow opening at the top is filled with uncooked rice, gently 
shaken down so as to be well packed. A table knife is plunged repeatedly 
into the jar, to a greater depth each time. After about a dozen plunges, 
the knife will suddenly bind so that, when raised by the handle, it will sup- 
port the whole jar of rice. 

* J.D. Bernal, A geometrical approach to the structure of liquids, Nature, 183 (1959), pp. 141- 


147, | 
+ Arnold Walfisz, Gitterpunkte in mehrdimensionalen Kugeln, Warsaw, 1957. 


CLOSEST RANDOM PACKING OF SPHERES 411 


EXERCISES 


1. Is the arrangement of incircles of all the faces of the tessellation {4, 4} (Figure 
22.4a) any less dense than that of the circumcircles of alternate faces, i.e., the circum- 
circles of the black squares of a chessboard? 

2. Is it possible to arrange seven equal non-overlapping spheres in such a way that 
two of them touch each other and both touch all the remaining five, while these five 
(a) form a ring in which each touches two others? (b) do not touch one another at all? 

3. Is it possible to arrange thirteen equal non-overlapping spheres in such a way 
that one of them touches all the remaining twelve while these twelve do not touch one 
another at all? 

4. A pyramidal pile with layers contains n(n + 1)(2n + 1)/6 cannon balls [Ball 
1, p. 59]; a tetrahedral pile contains n(n + 1)(n + 2)/6. In both cases the arrange- 
ment is cubic close packing. 


22.5 A STATISTICAL HONEYCOMB 


The fluidity of a liquid is a consequence of its molecular irregularity. 


J. D. Bernal (1901 - ) 


Three equal circles in a plane are packed as closely as possible when they 
all touch one another. The two-dimensional problem of close packing is 
easy because any number of further circles can be added in such a way as 
to continue the pattern systematically over the whole plane. This is, as we 
have seen, the pattern formed by the incircles of the faces of the regular tes- 
sellation of hexagons, {6, 3} (Figure 22.45). 

Analogously in space, four equal spheres are packed as closely as possible 
when they all touch one another, and some further spheres can be added so 
as to form the beginning of a pattern apparently consisting of the inspheres 
of the cells of a regular honeycomb {p, 3,3}. Although the equation 22,22 
has no integral solution when g = r = 3, we naturally conclude that a com- 
pressed random packing of equal lead shot, a nearly homogeneous aggregate 
of vegetable cells, and a froth of equal bubbles, are all somehow trying to ap- 
proximate to a honeycomb {p, 3, 3} in which p lies between 5 and 6. The 
fractional value of p means that this “honeycomb” can exist only in a statisti- 
cal sense, but the agreement with experiment is striking. 

When g = r = 3, the equation 22.22 actually becomes 


22.51 sin? = = cote =i. 


This shows that the angle ma is 35° 15’ 52’, which is half the dihedral 
angle of the regular tetrahedron {3, 3} (see Tablé II). (Infact, we may re- 
gard p as the number of regular tetrahedra {3, 3} that can be placed together 
around a common edge, as if we were beginning to construct the dual honey- 
comb {3, 3, p} whose vertices are the centers of the spheres.) Thus 
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180 


in agreement with Matzke’s observation that pentagons are prevalent (espe- 
cially in a froth) whereas hexagons are more frequent than quadrangles. The 
cell {p, 3} has an average of F faces and V vertices where, by 10.32 with g = 3, 


Fa 22. = 13.398..., Pan 99906. 


—p ~ (6/p) — | 
in reasonably close agreement with Matzke’s 13.7, with Bernal’s 13.3, and 
with one of the two theoretical models proposed by Meijering,* who used 
intricate statistical methods to obtain V = 22.56.... A fourth theoretical 
model [Coxeter 4, p. 756] yields 


F =4(23 + v313) = 13.564..., V =3(17 + v313) = 23.128... 


EXERCISE 
In the “twisted prism” formed by 28 regular tetrahedra 
AoAyA2dA3, A1A2A3d4, ...., Aa7ArsdooAs0, 


the broken line 49434649... Azo consists of 10 equal chords of a circular helix. Taking 
the axis of this helix to be vertical, do we find the vertex Azo exactly above Ao? (A 
model can be conveniently made by fastening together 87 equal sticks from the “D- 
stix Pre-engineering Kit 701,” manufactured in Yardley, Wash.) For the whole story, 
we regard the tetrahedra as being 28 cells of the “honeycomb” {3, 3, p}, where p is 
given by 22.51. (Ao41A2... is a “Petrie polygon” of this honeycomb.) Setting 


cos? = = a 
3 
and g = r = 3 in the equation 12.35 of Coxeter [1, p. 221], we obtain £; = 0 and 
cos 2 = —%. The angle between the planes joining the axis to Ag and Ago is 


30(g — 120°) = 354° 20’. 


(This é, being nearly 131° 49’, is remarkably close to the corresponding property of 
the four-dimensional polytope {3, 3, 5}, which is exactly 132° [Coxeter 1, p. 247].) 


*J. L. Meijering, Philips Research Reports, 8 (1953), p. 282. The value V = 22.79 ... was 
first obtained by C. S. Smith, Acta Metallurgica, 1 (1953), p. 299. See also E. N. Gilbert, Annals 
of Mathematical Statistics, 33 (1962), pp. 958-972, and R. E. Williams, Science, 161 (1968). pp. 
~76-277. : 
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Table I 


The 17 Space Groups of Two-Dimensional 


Crystallography (§ 4.3) 


Symbol Generators 
pl Two translations 
p2 Three half-turns 
pm Two reflections and a translation 
Pg Two parallel glide reflections 
cm A teflection and a parallel glide reflection 
pmm Reflections in the four sides of a rectangle 
pmg A reflection and two half-turns 
pPgg Two perpendicular glide reflections 

cmm Two perpendicular reflections and a half-turn 

p4 A half-turn and a quarter-turn 

p4m Reflections in the three sides of a (45°, 45°, 90°) triangle 

p4g A reflection and a quarter-turn 
p3 Two rotations through 120° 
p3m1 A reflection and a rotation through 120° 
p3im Reflections in the three sides of an equilateral triangle 
po A half-turn and a rotation through 120° 
pom Reflections in the three sides of a (30°, 60°, 90°) triangle 
Table Il 
The Five Platonic Solids (§ 10.3) 

Name Schlafli Symbol V E F Dihedral Angle 
Tetrahedron {3, 3} 4 6 4 70° 32’ — 
Cube {4, 3} 8 12 6 90° 
Octahedron {3, 4} 6 12 8 109° 28’ + 
Dodecahedron {5, 3} 20 30 12 116° 34’ — 
Icosahedron {3, 5} 12 30 20 138° 11? + 

Table III 
The Finite Groups of Isometries (§ 15.5) 
Rotation Groups Direct Products Mixed Groups 
Name Symbol Order Symbol Order Symbol Order 

Cyclic Ch n Cn X {I} 2n CounGe 2n 
Dihedral Da 2n Dy X {I} 4n DnCn 2n 
Tetrahedral Ag 12 A4 xX {} 24 DonDay 4n 
Octahedral S4 24 S4 x {I} 48 S4A4 24 
Icosahedral As 60 As X {TI} 120 
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The Regular Polytopes {p, q, r} (§ 22.2) 
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Table IV 


Name Schlafli Symbol No N, Ne N3 
Regular simplex {3, 3, 3} 5 10 10 5 
Hypercube {4, 3, 3} 16 32 24 8 
16-cell {3, 3, 4} 8 24 32 16 
24-cell {3, 4, 3} 24 96 96 24 
120-cell {5, 3, 3} 600 1200 720 120 
600-cell {3, 3, 5} 120 720 1200 600 
Cubic honeycomb {4, 3, 4} ora) ora) ora) ora) 
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Answers to Exercises 


§1.3 

1. The reflection in the line « = y interchanges x and y. 

2. If the line BA meets the circle in P (beyond A) and P’ (between A and B), we have 
BC? = BP x BP’ =(BA + AC)(BA — AC), 

3. The triangle CDF is equilateral; so is ABC. 

4. This result was conjectured by Paul Erdés and first proved by L. J. Mordell 
(see the American Mathematical Monthly, 44 (1937), p. 252, problem 3740, or Fejes 
Téth 1, pp. 12-14). In 1960, Mordell discovered the following simpler proof. For 
convenience, let OA, OB, OC, OP, OQ, OR be denoted by x, y, 2, p, g, r, so that the 
theorem to be proved is 

e+y+2e2ptget+r). 
Also let p’, g’, r’ denote the lengths (within the triangle) of the bisectors of the angles 
2% = / BOC, 28 = £COA, 2y = ZAOB. 


By comparing the area of the triangle OBC with the two parts into which it is dissected 
by the bisector p’, and using the well-known inequality y + 2 > 2Vyz which comes 
from (Vy — Vz)? > 0, we find 


yzsin 20 = p'(y + 2)sin« > 2p’ Vyz sin «, 


with equality only when y = z. Hence Vyz cos « > p’, and similarly V2x cos 8 >q’, 
Vay cos y > r’. Since 


ety +2 —2Vyzcos « —2V 2x cos B — 2V'xy cos y 
= (Vx — vycos y — v¥zcos B)? + (vy sin y — Vzsin py > 0, 
it follows that 
epy +2 R2Up +q +r) 2p +qt+r). 


The inequality + y + z > 2(p’ +q’ +1’), which may be regarded as an extended 
form of the Erdés-Mordell theorem, was first noticed by D. F. Barrow. See also 
O. Bottema, R. Z. Djordjevié, R. R. Jani¢, D. S. Mitrinovié and P. M. Vasié, Geometric 
Inequalities (Wolters-Noordhoff, Groningen, The Netherlands, 1969), p. 139. 

5. Equality occurs only if =y =z and Vysiny — vzsin 8 = 0, etc., so that 
a = § = y. The triangle is equilateral, and O is its center. 

6. Let # denote the “altitude” from A to BC, and A the area of the triangle ABC. 

419 


420 ANSWERS TO EXERCISES 
Since x + p > h,, we have 


a(x + p) > aha = 24 = ap + bq + cr, 
whence 
ax > bq + cr. 


Consideration of similar triangles shows that this remains true when the line AO is 
extended (even if O is on the far side of the base BC). Applying the same inequality to 
the image of O by reflection in the internal bisector of 7 BAC, we obtain 


ax > br + cq. 
Adding these two inequalities for ax, we obtain 
2ax > (b+ c)q +1). 
Multiplying together this and two other inequalities of the same kind, 
8 abexyz > (b + che + ala + bg + rr + php +9). 
Since b + c > 2Vbc, etc., we have (b + chc+ a)(a + b) > 8abc. Hence 
xyz > (gtr) t+ pip +49). 


Figure I-3e 


7. Let B be the smaller of the two different angles B and C of the triangle ABC, 
Let BY and CZ be the internal bisectors of these angles, as in Figure 1.3e. Take Uon 
AZso that 2ZCU = 4B. Since the triangle UBC has a smaller angle at B than at C, 
BU > CU, Take V on BU so that BV = CU. Take Won BY so that 2BVW = 
ZCUZ. By the angle-side-angle criterion, BVW and CUZ are congruent triangles, 
and BW = CZ. But Wand Y are on opposite sides of the line CU. Hence BY > BW, 
that is, BY > CZ, 

This theorem was proposed in 1840 by C. L. Lehmus, and proved by Jacob Steiner. 
For its history, see J. A. McBride, Edinburgh Mathematical Notes, 33 (1943), pp. 1-13. 
McBride asserts that more than sixty proofs have been given. The simple one given 
above came in a letter from H. G. Forder; it excels most by being ‘absolute’ in the 
sense of §12.1. For the original proofs by Steiner and Lehmus, respectively, see Journal 
fiir die reine und angewandte Mathematik, 28 (1844), p. 376, and Archiv der Mathematik 
und Physik, 15 (1850), p. 225. 
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§1.4 
1. Apply Euclid I.5 to the isosceles triangle GBC and then I.4 to the two triangles 
B’BC, C'CB. 
2. Add together three inequalities such as 
2BB’ + 3CC’ > BC. 


Complete the parallelogram CABK and observe that twice the median from A is 
AK <AC+CK=b+e. 


§1.5 
I. The circle through P with center O. 


2. Use 1.52. 
3. Let the tangents to the incircle from A, B, C be tq, ty, te. Then 


ttt, =a, te +t, =b, bth=e; 
therefore 
tg =t(b +c -—a) =s—a. 

4. By Euclid III.20, if the angle at the circumference is greater than 90°, the angle 
at the center is greater than 180°. 

5. At the midpoint of the hypotenuse. 

6. Make repeated use of Pythagoras’s theorem. 

7. Since bc + ca + ab = {(s — a)(s — b)(s — c) + abc + 33/5 =r? + 4Rr + 8? 
and abc = 4RA = 4Rrs, we have 


(2R — a)(2R — b)\(2R — c) = 8R® — 8R*%s + ZR? + 4Rr + 5?) — 4Rrs 
= 2RQR4+7r—5) 
Alternatively, in the notation of ex. 3, 
(la —r)ty — rtp —r) = (S ~r als —r —- bs —r—c) 

= (s — r)® — 29(s — 7)? + (7? + 4Rr + 8°\(s — 1) — 4Rrs 
= 2r?(s — r — 2R). 

This yields the desired criterion, since the angle A is acute or right or obtuse according 

as ft, —r is positive or zero or negative. In fact, we can conclude further that she 


triangle has an obtuse angle if and only ifr + 2R > s. Corrado Ciamberlini [Bolletino 
della Unione Matematica Italiana (2), 5 (1943), pp. 37-41] observed that 


4R? cos A cos Bcos C = s? — (r + 2R). 


8. Since 27, =e + &e& + 6 + 4g + = Ade. Also Desi; = Lee; + i) — 
Le? = $(Le,)? - Le? = 

9. If «, = & = 0, Beecroft’s equations imply 73 = 14 = 0 and eg = eg = 4 = Ne, 
so that the configuration consists of four lines E,, Hg, Es, Hy forming a square, and 
four circles Hy, Ey, Hy, Es having the sides of the square for diameters. This is what 
happens when k + / = m +n =0. In any other case, we can assign arbitrary values 
to the three bends «,, 2, 13, subject only to the condition «, + ¢g > 0 (which ensures 
that if E, and E, have internal contact, the larger circle is the one whose bend is 
negative). Then 2,4 is determined by the simple equation 


& + = Hg + M4 
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(which follows from «, + «, — €g + eg =2i3, €; + € + € — & = 244), and the 
remaining bends are 
Me = €1e Us’ — 1&2 ii €” = gM, ; ey? = Nga 
€, = ———_, 6 SS Say = ——_ , > = ——————— 
2 eet & a €; + & t €, + & he €y + & 
The proposed parametrization is obtained by choosing 
k=a/Veq + &, 1 = @/Veq + &, 
m = —13/Ve, + €9, n= —/Ve +e 
@ 
10. Using 1.59, 1.58, 1.52, and 1.56 in turn, we obtain 
1 1 l 2 


+ oF + - 
-a s-—-b s-cr 


€y = & + & + €& — 21% ee 


Fatt tre 2s r+4R 425 
~ A = A 
These two circles (which touch three mutually tangent circles) became known as 
Soddy’s circles before anyone noticed the earlier work of Descartes and Steiner 
[1, pp. 60-63, 524]. As Descartes used the letters d, e, f, v for the radii of four mutually 
tangent circles, he undoubtedly saw that he had obtained a quadratic equation for « 
in terms of d, e, f. 
11. Let A,B,C, be the feet of the perpendiculars from P to BC, CA, AB. Applying 
Euclid IL.21 or 22 to each of the cyclic quadrangles PA, B,C, PABC, PA,BC,, show that 
ZPA,B, and /PA,C, are either equal or supplementary. 
12. £C3B343 = 2C3P3P + 2PBs4, = (CBP + / PBA = ZCBA. 


§1.6 


1. The circumcenter of the new triangle is the orthocenter of ABC. 

3. At the vertex where the right angle occurs. 

4. Because sin B = sin C. 

5. On B’E, take C so that GC = GB, and A so that AB’ = B'C. 

6. It is bsin C, and b = 2Rsin B. 

7. One-third of the altitude. 

8. If the Euler line passes through 4, and if A is not a right angle, the altitude line 
AH is a median. 

9 Roos A = 3Rsin Bsin C. 


§1.7 
I. (a) One pair. (b) Three pairs. 
2. If A > B > C, the order is EA” FC’B"DA'C'B’. 
3. The angular measures of the relevant arcs of the nine-point circle (with A > 
B> C, as in Figure 1.7a) are 
AE=AF=24, BIF=B'D=2B, C'D=C'E=2C, 


whence 
DA = DG SIC = AD A'B! =2( — A — B), 
BE =2QA+B—>7), EF=2(2—2A), FC) =224+C-—7). 


4. The internal and external bisectors of an angle are perpendicular. 
. The nine-point center of /,4,/, is the circumcenter of its orthic triangle ABC. 
6. Each circumradius is twice the radius of the nine-point circle. 


w 
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§1.8 


I. U'V, which passes through W, is the image of UV by reflection in AC. 

2. When there is a right angle at A, V and W coincide with A, When there is an 
obtuse angle at A, this degenerate triangle UAA is still “better” than any proper 
triangle. 

3. The lines joining pairs of centers, being perpendicular to the common chords 
AP, BP, CP, make angles of 60° with one another, and thus form an equilateral 
triangle. 

4, At the Fermat point. (H. G. Forder uses analogous considerations to prove that, 
if ABCD is a tetrahedron and PA + PB + PC + PDisa minimum, the angles APB 
and CPD are equa! and their bisectors lie on one line.) 

5. They join pairs of villages to the ends of a short road in the middle. 

6. The “best” point for the ‘‘very obtuse’’ triangle is A itself. For the convex 
quadrangle it is the point of intersection of the diagonals. 

7. P is the incenter of P’BC. 

8. Let Z, X, U be the centers of the squares on three consecutive sides AB, BC, CD 
of the parallelogram ABCD. The triangle YBZ is derived from XCU by a quarter-turn 
(i.e., rotation through a right angle) about Y. 

9. Let M be the midpoint of CA. By ex. 8, the segments MZ and MX are congruent 
and perpendicular. The same can obviously be said of MY and MA. Therefore the 
triangle MAX is derived from M YZ by a quarter-turn about M. 

10. As in ex. 9, the segments MZ and MX are congruent and perpendicular. 
Similarly (by considering the triangle CDA instead of ABC), the segments MU and 
MV are congruent and perpendicular. Therefore the triangle MX V is derived from 
MZU by a quarter-turn about M. 


§1.9 
1. These lines are the medians of the equilateral triangle POR. 
2. (i)x = B =» = 40°; (ii) « = 30°, B = y = 45°. 


3. Since 2CP,Q = ZCPQ =y +2 = ZORA, the circumcircle of AQR passes 
through P,, and likewise through P.. Since 
P,Q = PQ = OR = RP = RP, 
the points P,, O, R, Py are evenly spaced along this circle. In the special case, each of 
the arcs P,Q, OR, RP, subtends 20° at A, and the triangle AQR is isosceles. 


§2.1 
ON, OD 1 


I. Si = = 
mee NiPy DPy V5’ 
ON, ON, I Vo =| 
= = = = cos 72°, 
OP, ON, +NiPy 14+ ¥5 4 
2.77 —3 =r 
§2.2 
If s is odd, 2* + 1 is divisible by « + 1, and therefore 2° + 1 by 2” +1. 
§2.4 


I. Suppose a given isometry of period 2 interchanges A and 4’, and interchanges 
Band B', where B does not lie on AA’. The midpoints of AA’ and BB’ are invariant 
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points. If they are distinct, the isometry is a reflection (by 2.31). If they coincide, it 
is a half-turn. 
2. (a) Mey ->(-#, -9; GG 9 > 8 + 180°). 
(b) (i) @, ¥) > (-y, #); (ii) (r, 8) > (r, 8 + 90°). 


§2.5 

(i) (r, 0) > (r, 0 + &); 

(i) (@, y) -> (vcos « — ysin « + y cos x). 
The transformed curve is f(r, @ — «) = 0. 


§2.6 


1. If O is the center of a suitable one of the two squares that can be drawn on BC, 
the first quarter-turn is the product of reflections in CO and CB whereas the second is 
the product of reflections in BC and BO. 

2. P is transformed into A by a quarter-turn about C, and thence into S by a quarter- 
turn about B. 


§2.7 
I. (a) Cy, (b) Dy, (c) Dy, (d) Cp, 
(e) Dz, (f) De, (g) Dy. 
2. RTT = STT". 
3. R,R, = R,R,R,? = RR, RR, = RZR,R2; therefore R,? = 
and 
(Ri Ro)? = RiReRi* R.RiRe = (R,R2R,)? = 1. 


4. The periods of the elements of C,, are divisors of n. 


§2.8 


2. If the angles are all equal, sides of two different lengths can only occur alternately, 
and this is impossible if their number is odd. 

3. 108°, 36°, 140°, 100°, 20°. 

4. Circumradii/V4 + 2V2,/V2(V3 £1); inradii /(./2 + 1), (2 + V3); vertex 
figures /V2 + 1/2, (V3 + 1)/v2. 


2kn 
5. (. ) 
H 


6. Yes. Make every cut from the center. If the perimeter is divided into equal parts, 
the area is automatically divided into equal parts. 


§3.1 


I. Rotation, translation. 

2. Reflection. Yes. 

3. If the perpendicular bisectors are distinct, find where they intersect. (If they are 
parallel, the segments are not related by a rotation!) If they coincide, find where the 
lines 4B and A’B’ intersect. If these lines also coincide, the center is the midpoint 
of AA’. 

4, Rotate the first two mirrors until the second (in its new position) coincides with 
the third. 


ANSWERS TO EXERCISES 425 


§3.2 

1. TO. 

2. It can be any line perpendicular to the direction of the translation. 

3. A translation. 

4. Translate the first two mirrors till the second (in its new position) coincides with 
the third. 

5. Any translation is expressible as the product of two half-turns, one of which may 
be arbitrarily assigned. Therefore, if H,, Hz, Hg are half-turns, 


HoH3 = HH, 
for a suitable Hy; that is, H,H,H, = Hg. 
7. (&, y) > (« +, y). The transformed curve is f(z — a, y) = 0; for instance, 
the unit circle with center (a, 0) is 
@-arp+y-—1=0. 
§3.3 


I. (i) Half-turn about B, or reflection in the perpendicular line through B. 
(ii) Translation from A to B, or a glide reflection. 

2. Two is the only even number less than or equal to 3. The product of a reflection 
and a half-turn is a reflection or a glide reflection according as the center of the half- 
turn does or does not lie on the mirror. 

3. Reflection in the perpendicular line through O. 

4. Half-turn. 

5. An opposite transformation. 

6. The relation R;R,Rs = R is equivalent to R|Rz = RR3, which means that R,R, 
and RR; are either equal rotations or equal translations. 

7. G? = TR,R,T = T°. 

8. A glide reflection, An equation requires an expression for the o/d coordinates 
in terms of the new. 


§3.4 


(a) 0,0,0,0, is a parallelogram (possibly collapsing like OO’Q’Q in Figure 3.2a). 
(b) If #7, and mz, intersect, m bisects one of the angles between them. If instead they 
are parallel, m is parallel to them and midway between them. 


§3.5 


I. S$ is a glide reflection. 
2. If a translation commutes with a reflection, its direction must be along the mirror. 


§3.7 
I. (i), ii), Gi), (iv), (v). 
2. (iii), (v). 

§4.1 


I. Each side of a Dirichlet region joins the circumcenters of two congruent triangles 
having a common side. 


§4.2 
1. Because two opposite vertices of each quadrangle are related by a translation, 
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2. Place the two parallelograms in such a position that a side of one is part of a side 
of the other (with a common vertex at one end). 


§4.4 


No. A “Procrustean stretch” [Coxeter and Greitzer 1, p. 102], which doubles 
vertical distances while halving horizontal distances, can be applied to three of the 
patterns in Figure 4.4a, namely those illustrating the groups p2, cmm, pmg. 


§4.5 


1. Rotation through the same angle about P’. 
2. If T is a translation and S is a rotation whose period is greater than 2, then 
STS is a translation in a new direction. 


§4.6 


I. Since the vertex figure is regular, two adjacent faces are alike; therefore any two 
faces are alike. Since the face is regular, two adjacent vertices are surrounded alike; 
therefore any two vertices are surrounded alike. 

3. No. To complete the lattice we would also need the centers of the hexagons. 


§4.7 


1. If Q were not between P, and P3, we could obtain another pair having a smaller 
distance than P,Q. 

2. Use induction over n deriving a set of n — | points by omitting one of two whose 
join contains no others. 

3. A complete quadrangle with its diagonal points. 


§5.1 
1. O(). 
2. It divides O,O, in the ratio (Ag — 1):(A, — Way. 
3. (a) (r, 8) > (Ar, 9), 
(b) (%, y) > (Aw, Ay). 
4. By similar triangles, OP’/OP = OA’/OA. 
5. By taking O between A and J’. 


§5.2 


2. Ifa common tangent T7'meets the line of centers in O, the dilatation O(OT‘/OT) 
transforms the first circle into the second. 
3. It divides O O, in the ratio (A, — 1):(1 — A). 


§5.3 

et i 8 HG, 

2. The nine-point center is the same for all. 
§5.4 


Since any invariant point of a transformation is also an invariant point of the inverse 
transformation, we lose no generality by considering a similarity ABC — A’B’C’ in 
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which ABC is the larger of the two given similar triangles. (If it were the smaller, we 
would alter the notation and consider the inverse similarity instead.) If A and A’ 
coincide, we have already found an invariant point, If not, suppose the similarity 
transforms 4’ into A”, A” into A”, and so on. Let » denote the ratio of magnification, 
so that 4’B’ = 7AB,O <p <1. Then A’A” = AA’, AXA” = nw A'A", and so on, 
The circle with center A and radius (1 — #)1AA’ is transformed into the circle with 
center 4’ and radius «(1 — wy AA’. Since 


fa 1 
lo-w l= 


1+ 


the former circle encloses the latter. Continuing, we obtain an infinite sequence of 
circles whose radii u"(1 — x“) 1A’ tend to zero as n tends to infinity. Since these 


circles are “nested,” their centers A, A’, A”,... converge to a point of accumulation O, 
Since the similarity transforms AA’A” --- into 4’A”A”,... , which is essentially the 
same sequence, the point O is invariant. 

§5.5 


I. A dilative rotation, possibly reducing to a dilatation or a rotation. 

2. Let P be the point of intersection of the corresponding lines AB, A’B’. Let the 
circles 4A’P, BB'P, which have the common point P, meet again in O. The triangles 
ABO, A'B’O (possibly collapsing into triads of collinear points) are easily seen to be 
directly similar. Hence, this point O is the invariant point of the direct similarity 
AB > A'B’, 

If AA’ and BB’ are parallel, O coincides with P (and the two circles have a common 
tangent at that point), In any other case, an alternative construction makes use of the 
point 7 where A’ meets BB’, The circles ABT, A’ BT, which have the common point 
T, meet again in O. 

It follows that the four circles AA’P, BB’P, ABT, A'B'T all pass through one point 
[Baker 1, p. 110]. 


§5.6 


1. This is the invariant point of a similarity. 

2. The segments AB and A’B’ are related by a direct similarity and an opposite 
similarity. The latter is a dilative reflection whose two axes divide each segment PP’ 
in the ratio AB: A’B’ (one internally and the other externally; see Figure 5.6a). If 
A, coincides with B,, or Ay with By, the direct similarity is a dilatation, and the analog- 
ous points P, or Ps (respectively) all coincide. 

3. If § is reflection, S* is the identity. If S is a glide reflection, S* is a translation, 
which is one kind of dilatation. If S is a dilative reflection, S? is a central diltation. 

4. (a) A dilative rotation, possibly reducing to a dilatation or a rotation. (b) A 
dilative reflection or a glide reflection, possibly reducing to a reflection. 

5. As we saw in §5.4, the invariant lines of the dilative reflection meet in the in- 
variant point O and are the internal and external bisectors of ZAOA’. By Euclid 
V1.3 and its “external” analogue, they are the lines OA, and OAy. The same reasoning 
can be repeated using the B’s instead of the A’s. 

6. (a) A dilative rotation. (b) A dilative reflection. 


§6.1 


2. Draw two circles with centers O, A and radius OA, meeting in C, C’. The circle 
with center C and radius CC’ determines B on the circle OCC’. 
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3. Find the inverse of a point distant 24 from O and double its distance from O. 
For a point whose distance from O lies between k/2n and k/(2n — 1), apply the dilata- 
tion O(n), invert and then apply O(”) again. 

4. To bisect OA, construct B as in ex, 2, and use three more circles to construct its 
inverse in the circle with center O. 

5. To divide OA into n equal parts, transform A by the dilatation O(n), and invert 
with respect to the circle with center O and radius OA. 


§6.3 


1. Compare Figure 6.3b with Figure 5.24. In the case of equal intersecting circles, 
one of the inversions is replaced by a reflection. 

2. Let Q denote the center of the rhombus APBP’. Then 

OP x OP’ = (OQ — PQ\(OG + PQ) = OG? — PQ? 
= OQ? + AQ? — (AQ? + PQ?) 
= OA? — PA?, 

3. Let N be the midpoint of BD, and H the foot of the perpendicular from A on 
the line BD. Suppose AO = «AB, so that OP = uBD and OP’ = (1 — w)AC. Then 
BD x AC = (HD — HB)(HD + HB) = HD? — HB? 

= AD* — AB 
and 
OP x OP’ = nBD x (1 — WAC = pl — (AD? — AB), 

4. Let dbe the distance from O to the center of 7. Comparing the diameter through 
O of y with the corresponding diameter of the inverse circle, we see that the latter is 
of length 

kK kK? 2k*r 2k?r 
d-r d+r @—-?r p° 


§6.5 


1. Orthogonal circles invert into orthogonal circles, and any circle orthogonal to 
the circle of inversion inverts into itself. 

2. The two limiting points are the common points of any two members of the 
orthogonal pencil. 

3. Let 01, %2 be the two given circles, and f,, fz any two circles that cut them both. 
Let /,, be the radical axis of «; and 8,. Let P,; be the point where /,, meets 4,. Then 
P,P, is the radical axis of «, and a». 

4. One is the center and the other is the point at infinity. 

5. Invert the whole figure in a circle whose center is one of the points of contact. 
Two circles of the ring become parallel lines, say a and b. The rest have their centers 
and points of contact on a line /, perpendicular to a and b. The original circles become 
equal circles both touching a and b. The line / serves as a mirror reflecting these two 
circles into each other. The inversion transforms this reflection into an inversion. 

The centers of the circles lie on an ellipse. 

6. Inverting in a circle whose center is the point of contact of the tangent circles 
with centers A and B, we obtain two parallel lines E,, E,, and a circle E; sandwiched 
between them (as in the first part of the answer to ex. 9 of §1.5). In this very simple 
case, Soddy’s circles (in the same sandwich) are congruent to E,; and tangent to it on 
opposite sides. Their radical axis Hy, joining the points of contact of E, with E, with 
E,, is the inverse of the incircle. 
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§6.6 


1. Each circle of Apollonius is orthogonal to all the circles through A and 4’. 

2. With its center where the perpendicular bisector of AA’ meets /, draw a circle 
through A (and 4’), meeting /in P, (near A) and P, (near A’). Consider the value of 
the ratio 

A'P 
~ AP 
for various positions of P. Since P,P, is a diameter of the circle drawn through A 
and A’, two of the circles of Apollonius touch /at P, and P, respectively. Let #, and 
be the values of « on these two circles. 

On all circles of Apollonius inside the one through P,, we have > #4, and on all 
circles outside, 4 < 4. Therefore, among the various positions of P on /, P, has the 
maximal #, namely « = #4. Similarly, the circle of Apollonius through P, has # < fg 
inside and »« > #4, outside; therefore, among all positions of P on /, P; has the minimal 
#, Namely fo = fy. 

3. v/G — 4). 

4, Since O and O are the common points of two circles of Apollonius, we have 


i 


OA’ OB OA'B’ 


OA OB AB 


and the same with O replaced by O. If A’ = B, use ex. 3 on p. 76. 

5. Inverting in a circle whose center is either of the limiting points of the coaxal 
pencil, we obtain three concentric circles whose radii satisfy either a, <a) < ag or 
a, > Gy > ag. Choosing the limiting point that yields the former order, we find 


( el a» 1 as 1 as 
ea = = 
%4, %q) + (Kg, Xe og a + log is og Bi, 


= (41, %). 


For further details see Coxeter and Greitzer 1, pp. 123-131. 

6. The circle of similitude is a circle of Apollonius, namely, the locus of a point 
whose distances from the centers of the two given circles are proportional to their 
radii, say OA:OA' = rir’. The direct or opposite similarity that transforms OA into 
OA’ also transforms the given circle with center A into the given circle with center 
A’. It follows that the locus of points from which the two given circles subtend equal 
angles is their circle of similitude or, if the two circles intersect, it is the part of their 
circle of similitude that lies outside them. 

For two equal circles, the locus reduces to their radical axis. 

7. Let the given circles have centers A, B and radii a, b. Let P, Q be the inverses 
of a point W on the circle of similitude. Then 


APX AW a (7) (5 a 


BOx BW B? \b BW 
AP AW 
BO BW’ 


and PQ is parallel to AB. Since both the given circles are orthogonal to the circle 
WPQ, their radical axis is a diameter of the latter, namely the diameter perpendicular 
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to AB. Since this diameter is also perpendicular to PQ, P and Q are images of each 
other by reflection in it. 


§6.7 

J,S =S+S7J,S = SJ',, where J’; is the inversion in the circle with center O and 
radius k. The two inversions are the same if and only if the isometry S leaves O 
invariant, that is, if and only if T interchanges O and O’. 


§6.8 
1. OA x OA' = OB x OB' and ZAOB = ZA'OB'. 
2. The ratio of magnification is 


OA’ OAXOA Fk 
‘OB OAXOB. ab 


3. Leta = OA, b = OB, c = OC,d = OD. Then 


A'B' x C'D' (kab) AB x (k/ced)CD _ AB x CD 
A'C' x B'D'  (k2/ac)AC x (k?/bd)BD ~~ AC x BD 


4. Spheres through O invert into planes. Two spheres that touch each other at O 
have no other common point. Two planes that have no common point are parallel. 

5. After inversion we have a sphere y “sandwiched” between two parallel planes 
«and f. All the spheres o,, o,,... are congruent to y. The section of the figure by 
the plane midway between « and f is a circle touching a ring of six congruent circles. 


§6.9 

I. Consider, for instance, two circles of radius 4a whose centers are distant $7. 
When each circle is represented by a pair of parallel small circles on a sphere, the 
points of intersecton are the vertices of 2 squares. (The common radius of the circles 
may be taken to have any value between 7 and $7. The result is most evident when 


this value is nearly $7.) 
2. Prove this first for a triangle and then dissect the p-gon into triangles. 


§7.1 

Let R denote the reflection in the plane of the two lines, R, and R, the reflections 
in the planes through the respective lines perpendicular to that plane. Then the half- 
turns may be expressed as R,R and RRg, so that their product is R,Ro. 


§7.2 
The identity. 


§7.3 
The reflection in another parallel plane. 


§7.4 


1. The reflection in another plane through the same line. 
2. The two tetrahedra OABC and OA'B'C’, being congruent, are related either by a 
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rotation or by a rotatory-inversion. In the former case any point on the axis of rotation 
would be, like O, equidistant from A and 4’, from Band B’, and from C and C’. 


§7.5 
1. (a) Reflection, (b) Quarter-turn, (c) Translation, 
(d) Twist, (e) Glide reflection, (f) Rotatory inversion. 
2. In the notation of Figure 7.5a, the half-turns are R’,R’s, R’3R’,, and their product 


is the twist R’,R’s* R’ R's. 


§7.6 


1. It is transformed into 
(ux cose — wysin a, wesin « + BY COS %, HZ). 


2. Axis# = y =z, Angle 27/3. 
3. This is a dilative rotation with angle 7 and ratio —A. 
4. Yes. Simply use spheres instead of circles. The same proof is applicable in a 


space of any number of dimensions. 


§7.7 

An isometry is the product of four or fewer reflections. Any other similarity is the 
product of a rotation and a dilatation. If the ratio of magnification happens to be 
negative, we can use instead a rotatory inversion and a direct dilatation. Since a 
direct dilatation is the product of inversions in two concentric spheres, this makes, 


altogether, two or three reflections and two inversions. 
Finally, the product of an inversion and an isometry is the product of an inversion 


and r reflections, r < 4. 


§8.1 
3. If P; is (2;, yi), Muy is 


and the midpoint of M.Ma, is 
= +X +%y +h Yyt+¥o+Ys ta) 


4 , 4 

§8.2 

1. ry? + 19? — 2ryre cos (02 — 6). 

2. (r, 9), where 

os g[ry? + re? + 2ryrecos (9; + 4)] 
in 6 . sin 8 

and re pines can eee 
r, Cos 8, + ry cos 9% 

3.6=4, 

4. The respectively parallel lines through the origin are ax + by =0 and a’x + 
b'y =0, or 
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The condition is derived from 8.22 by writing it in the form 
, 


; that is, —l. 


eal bb 
5. Replacing « and y by «cos« —ysinaand «sina +ycosx, where « = 
darctan (—%*) = arctan (—4), we obtain 


4(4a + 3y)? + 24(4a + 3y)(—3a + 4y) + 11(—-3a% + 4y)? = 125, 


which reduces to 
—x? + 4y? = 1, 

§8.3 

1. c(x? + y®) + (ax + by) = 0, 

c(x? + y®) + 2k? ge + fy) + kt = 0. 
2.02 + y? = k?, 
3. (x? + PP = 2a*(x® — y*), 
r? = 2a’ cos 28. 
6. From 


rcos@ +b = 2bcost — bcos 2t = 2bcost (1 —cost) +b 
and 
rsin @ = 2bsinr — bsin2¢ = 2bsinzr(1 — cost) 


we deduce 7? = (rcos 0) + (rsin 0)? = 46°(1 — cost? and tan@ =tant. When 
8 is replaced by —0, r changes from 25(1 — cos 6) to 2b(1 + cos 8). The sum of these 
distances is 45, for all values of 6. 


§8.4 


1. A parabola. 

2. A hyperbola. 

3. The half-turn about the center yields a second focus and a second directrix for 
any central conic. For the ellipse, the two foci lie between the two directrices; for the 
hyperbola, the foci lie beyond the directrices. Let O and O’ denote the foci and K’ 
the foot of the perpendicular from P to the second directrix. Since 


OP = «PK and O'P = «PK’ 
we have, for the ellipse, 
OP + O'P = (PK + PK’) = «KK", 
and for the hyperbola, 
O'P — OP = «(PK' — PK) = «KK’. 
4.¢= V1 = Ba, V2. 
5. Since the circumcenter must be equidistant from A and C, we have 
w+ GyP = 1 + Gy’, 
e+ dP =i. 


6. By the theory of quadratic equations, F is the product of two linear forms if it is 
indefinite, and a perfect square if it is semidefinite. 
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7. 2ay = a’. 

8. For each point P on the ellipse there is a corresponding point P’ on the auxiliary 
circle whose diameter is the major axis, PP’ being perpendicular to that axis. The 
radius through P’ makes an angle ¢ with the major axis. 

9. Both branches are included. 

10. Replace r by P/r. 


§8.5 
1. The parabola = 2/17, y = 2/t meets the line Y¥x + Yy + Z = 0 in points given 
by the roots of the quadratic equation 


X20? + Y:2it+Z=0. 


The sum and product of the roots, say ¢ and t’, are 


Y ; zZ ty 
a cam a a xX 


2. The secant of the ellipse x =acost,y = bsint is 


x y 1 
acos(« + 8) bsin(« + 8) 1] =0. 
acos(« — 8) bsin(« ~— f) 1 


For the hyperbola, the tangent is 


v y., 
-cosh ¢ —=sinht = 1. 
a b 


3. The envelope of the line Xx + Yy + Z = 0, whose coefficients ¥, Y, Z are 
functions of a parameter f, is the locus of its point of intersection with 


(X + dX)e + (¥ + dY + (Z +dZ) =0, 
or with Xe + Y’y + Z’ =0, where Y’ = @X/2t, etc. Differentiating 
axsect — bycsct =a? — B 


and dividing by sin ¢ cos ¢, we obtain 


ax —by  axsect — bycsct 
aa oS nnn a2 — BF, 
cos? ¢ sin? ¢ cos? ¢ + sin? ¢ 
Thus the envelope of normals is the locus of (x, y) where 
ax 4 by = 
Ga = cos” f, pe = —sin’ f. 

§8.6 

1. mab. 

2. d1ab. 
§8.7 


1. 42% = aut", This inversion has the same effect as reflection in the initial line. 
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(il) 


(i) 
These drawings were made by Ryszard Krasnodebski. See also Coxeter, Mathematical 


Gazette, 52 (1968), p. 5; Aequationes Mathematicae, 1 (1968), pp. 112-114. 


§8.8 
1. Eliminate X: Y:Z: T from the four equations 
XX, + Yy, + 24; =T and Xx + Yy +22 = T. 
The effect of replacing the ith point by a direction is the same as the effect of replacing 
it by (X,t, Yi, Z;t), so that the ‘th row becomes 
Xt Y,t Zt 1 
or (equally well) 
X; Y; Z; Vt, 


and then making 1/t tend to zero. 
5. The condition for the radii to any common point (, y, z) of the two spheres to 
be perpendicular is 


(@ tue tu) + yt vy +o’) 4+ (+ wz 4+’) = 0. 


The desired condition is obtained by doubling this and subtracting the equations of 
the spheres. 
6. The polar plane of (X, Y, Z) passes through (X’, Y’, Z’) if 
XX'+ YY'+ZZ’ =k. 

The symmetrical nature of this condition shows that then the polar plane of (X"’, Y’,Z’) 
passes through (X, Y, Z). (Two such points are said to be conjugate with respect to 
the sphere.) The special case asked for arises when (X’, Y’, Z’) lies on the sphere and 
(X, Y,.Z) lies in the tangent plane at that point. 
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7. Factorizing both sides of the equation 


v Yee 7 zP ¥ c. 
—cos%* +-sinaw|— [-]= —|[—cose« —~-sin«]+ 1, 
a b C) b a 


we see that, for each value of «, every point on the line 


£ ¥y. fe ¥ en 

-cos*+-sne*=-, —~cos% —-Ssina =] 

a b c b a 
lies on the hyperboloid. Reversing the sign of Z, we obtain the other system of generators. 
The general generator of the first system meets the special generator 


x z y 
a ae ak 
of the second where 
z z 
Simo 7 (Ui cog a), ¢ sina = 1 — cos a. 


The consistency of these equations shows that generators of opposite systems intersect 
(or else, if « = 7, are parallel). On the other hand, any common point of the general 
generator of the first system and the special generator 


“2 Yy 1 


ac’ b 


of the same system would have to satisfy both the equations 
. 2 @ . 
sin « =-(1 — cos «), —-sinea = 1 —cosa, 
c c 


which can happen only when the two generators coincide, 


§9.3 
loc =2 ti. 
2. u + vi = 0 means that the point (u, ») coincides with the origin (0, 0). 
3. (a + bi) =x + yi, where 
a b 
Sage ies y 2 25 
a+b a + Bb 


na 


4. (i) The dilative rotation reduces to a dilatation, and the two shaded triangles are 


homothetic. 
(ii) The dilative rotation reduces to a rotation, and the two shaded triangles are 


congruent. 


§9.4 


(a) 3 + 4/ = 5(cosa +/sin x), where cosa = 3, so that cosda = V$ and 
sin $2 = /%. Hence one square root is 


(3 + 4? = 54cosda + isinga) = 2 +7 
and the other is —(2 +/) = —-2 —i. 
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(b) 1 =cos0 + /sin0 = cos2z7 4+ /sin27 = cos4a + isin 47. Hence the three 
cube roots are 
cos0 +isinO = 1, 
cos 2a +isinx7 =4(—1 +773), 
cos4a7 + isinga7 = 3(—1 — iV3). 
(c) £1, ta, +e?. 
(d) The same and also +/, tiw, +iw?. 


§9.5 


1. ef? = costa +isinga = 7. Yes, 


§9.6 
By Pythagoras, 


§9.7 


1. The angle is am a 
2. The angle is am a. 


§10.1 

1. The octahedron is a square dipyramid with equilateral side faces. 

2. A triangular dipyramid. 

3. (i) A square, (ii) a hexagon, (iii) a decagon. 

4. The cutting plane, for either acute corner, passes through the midpoints of the 
three edges that meet there. The rhombohedron, like any parallelepiped, can be 
repeated by translations to fill the whole space without interstices. 


§10.2 


1. The bases appear as two pentagons: a large one with a small one oppositely 
placed inside. 

2. Seven. 

3. Eight. 


§10.3 


1. Use 10.32. 
2. If a polyhedron has p > 4 for every face and qg > 4 for every vertex, 


4V—E+F) < 2g -4E+ ip =2E-—4E4+2E 
= 0. 


3. The only possibility is a tessellation of rhombi whose vertices form a lattice. 


§10.5 


1. The edges at a vertex are mutally orthogonal and of equal length. 
2. One vertex in each “octant.” 
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3. i, 1, 1). 
4. From the cube in ex. | we derive the tetrahedron 
(0, 0, 0)(0, 1, Dd, 0, Dd, 1, 0), 
with face planes © +y+z2=2,"=y+2,y=2+a,2=2+yYy, To normalize 
these equations we divide by /3. The cosine of the internal angle between two of the 
planes is }. 

5. (£1, 0, 0), (0, £1, 0), (0,0, +1). The face planes are +4 + y +2=1. The 
edge joining (1, 0, 0) and (0, !, 0) belongs to the planes + y +2 = 1, which make 
an angle whose cosine is — 3. 

6. Points between the parallel planes x + y +2 = 1 satisfy + y +2 <1 and 
~«%—y—2 <1; similarly for the other pairs. 

7. 120°. The regular tetrahedron 


(=k; -i, —Iy(-1, 1, Da, =i, Hd, 1, —1) 
has its center at the origin, The planes joining the origin to pairs of vertices are 


ytez=0, z4+2e=0, tty =0. 


§11.1 


Oo. _ 20 7. oT 
I. rsing =sin-> =2cosgsing. 


2. Center S, radius QU. 


§11.2 
1. The four points (+7, £1, 0) evidently form a golden rectangle in the planez = 0. 
2. The segment (0, 0, +*)(0, 7”, 0) is divided in the ratio +:1 by the point (0, 7, 1). 
3. (0, £77, +7), (47, 0, 47°, (4771, 47, 0), (£1, £1, £1). Thus the 20 
vertices belong to 3 “‘doubly golden’’ rectangles (whose sides are in the ratio 77:1) and 
a cube. 


§11.3 


1. Corresponding sides of the two rectangles meet in the points B, D, F, H. The 
lines BF and DH meet in O. 

2. The points 7, G, C, A, having polar coordinates (7, —7), (771, —47), (7, $7), 
(77, 7), have Cartesian coordinates (—7~?, 0), (0, —771), (0, 7), (—7?, 0). Hence the 
lines JC, GA are 


mxe—-y+r=0, a+ Fy +7? =0; 
and H, their point of intersection, is (—$, —4) or, in polar coordinates, (V4, —?7). 
Thus the points J, H, F, D, B, given by 
r=24;", 6 = 4Qn ~— 3)n (n = —1, 0, 1, 2, 3), 


lie on the spiral 8.71, where a = 2-#72. This is derived from the spiral r = x? by the 
dilatation O(a) or by rotation through the angle 

log a 

log #° 
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§11.4 
ls fog hs, 
2. Using induction, assume fy_ofn —f2 4. = (—1)". Then 
Sn—1fn41 a = fraGra + fr) — fr fn—2 + fra) 
=fpy — fa-2fn = (-1)". 
3. Working modulo 10, we have 5 + 8 = 3,8 +3 =1, and so on. 
4. Setting & + j =n, we see that the coefficient of r” is 


=() 


where k =n — j and, since2) <j +k =n,0 <j < $n. 
5. 1.010203050813213455 .... 


6. By 11.48, 
Envi ntl 4s (-7y) 1 7+ (1h eed 
En = ae (-7y)" = 1+ (—1)?r-2" 3 
Hence the limit is 7. 
§11.5 
When & = 1, 11.51 yields h = 7-9? = 0.48587... 
§12.1 
1. Both. 


2. The sum of the three angles of a triangle is equal to two right angles. 
3. Affine geometry. 
4. (a) Affine, (b) absolute, (c) absolute. 


§12.2 

1. By Axiom 12.22, there is a point C with [ABC], also a point D with [BCD], and 
so on forever. 

2. Theorem 12.271. 

3. If [FDE], we could apply 12.27 to the triangle BFD with [DCB], obtaining Z on 
EC with [BZF]. Since Z = A, this contradicts [AFB]. 

4. Any line not belonging to the set contains an infinite number of points, among 
which only a finite number can lie on lines of the set (at most one on each). 

5. Use 12.278 (and Fig. 12.2d) with D, A, B, F, C, L replaced by A, B, C, L, M,N. 

6. Take A’ on 4/B, B’ on B/C, and apply Axiom 12.27 to the triangle 4’B’B with 
[B’BC] and [BA4’]. 

7. For any such 4’ and B’, the line A’B’ meets 4/C; therefore it does not meet C/A. 


§12.3 

The n — 1 points P; (i > 1) are joined in pairs by at most Cy) lines, some or all of 
which may meet P,Q. In Figure 12.30, the six joins P,P,, PyP , Py P2, P3P¢, PePi, PiPs 
all make the same contribution as P,P;. 


§12.4 


1. Since the five points are not collinear, they must form either a convex pentagon 
or a convex quadrangle with one point inside, or a triangle with two points inside. In 
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the first and second cases the result is obvious. In the third case the two inner points 
lie on a line meeting two distinct sides of the triangle. The ends of the third side form, 
with these inner points, the desired convex quadrangle. 

4. The first two lines, BC and CA, decompose the plane into four angular regions. 
The line AB has no intersection with the region bounded by C/A and C/B, but it 
decomposes each of the remaining three angular regions into two parts. The region 
bounded by the triangle is the only finite part, since at least one side of each of the 
others is a ray. 

5. Consider any m — | of the m lines, and the f(2, m — 1) regions formed by them. 
They decompose the mth line into m parts (namely, two rays and m — 2 segments), 
lying respectively in m of the f(2, mm — 1) regions. These m regions are each decomposed 
into two, whereas the rest are unaffected. Hence 


f[2,m =fQ,m—-l+m. 
Combining this with the analogous equations 


f[2,m—-l)=f/2,m—2)+m—1, 


f2,1) =f2,0) + 1, 
we obtain 
f2,m =f2,m—-1)+m 
= f(2,m—2)+(m—-1l1)+m 


=f(2,0)+1+2+:+::+m 


' m+ m m m 
SP EN ge PS Vg Py Po 
6. The first m1 — 1 planes decompose the mith into f(2, m — 1) plane regions lying 


respectively in f(2,m — 1) of the f(3, m — 1) solid regions. These f(2, m — 1) solid 
regions are each decomposed into two, whereas the rest are unaffected. Hence 


£G,m) = fG3,m —1) + f(2,m — 1) 
= [3,m —2) + f2,m — 2) + f(2,m — 1) 


= £3, 0) +f, 0) +2, 40> + f(2,m — 1) 
m 1 
1+ 3/1 +()|= ema ("S ) 


eye) 
-()+() «+6 


7. f(n, m) = f(a, m — 1) + f(n — 1, m — 1), with f(m, 0) = 1. Therefore 


f(n, m) = f(n, 9) +S fa — 1,r). 
r=0 


fom = () + (t) +--+ () 


To prove 
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by induction, assume the same formula with n replaced by n — 1. 


piamd poo +S{(6) + () == (,2) 
FQ) FQ eH EOL 
+) QC) 


The final step makes use of the familiar series 
a r =s. r+i1 r 
roni\n — | eat) 7 n 
§12.6 


1. The relation [prs] tells us that the three lines do not meet one another and that 
they contain points 4, C, B (respectively) such that [ACB]. Suppose that p, is parallel 
tos, Then any ray from 4 within the angle between 4B and p, meets s, and therefore 
also r. Hence p, is parallel to r. 

2. The two rays through a given point parallel to a given line appear as the segments 
joining the point to the ends of the chord. 


fl 
— 
® 3 
~~ 
I 
— 
= 
= 4 
eee 
Il 
eno eae 
= 3 
ee 


§13.1 


1. Let g and r be the two parallel lines. If p met g without mecting r, then p and q¢ 
would be two lines through the point p-q, both parallel to r, contradicting Elon 13.11. 
2. Yes, provided the three lines are coplanar. 


§13.2 


1. This is a one-dimensional version of the principle that every direct isometry 
(including the identity) is the product of an even number of opposite isometries. 
Translations preserve directions, whereas half-turns reverse directions. 

2. By 13.25, (A > D) = (C > B) implies (A <> B) = (C > D), that is, (A <-> B) = 
(D <> C), which, in turn, implies (4 > C) = (D > B). 

3. This is the half-turn about C. 

4. Any two opposite sides are interchanged by the half-turn about the common 
midpoint of the diagonals. 

5. Using the symbol = to relate congruent segments, we have 


BA, = CB, = A.C, BA, = CyB, = ASC. 
By ex. 2, BA, = A,C implies BA, = A,C. Hence As = Aj. 


6. Dissect the quadrangle into two triangles by a diagonal, and use 13.26. 
7. The three bimedians all have the same midpoint. 


§13.3 


1. Each point on AB is transformed into a point dividing the segment 4B in the 
same ratio, that is, into itself. For any other point P, we can draw PM parallel to 
CA, and PN parallel to CB, with M and N on AB. The corresponding point P’ is 
obtained by drawing MP’ parallel to AC’, and NP’ parallel to BC’. 
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This is a typical affine construction. Instead of Euclid’s “ruler and compasses” 
we are using a simpler instrument, the paralle/-ruler, which consists of four rulers (or 
two rulers and two auxiliary bars) forming a parallelogram with pivots at the four 
vertices. This enables us to draw lines parallel to a given line (and, of course, also to 
join two given points). 

2. If an affinity interchanges two points C and C’, it leaves invariant the midpoint 
O of CC’. If O is the only invariant point, it is also the midpoint of PP’ for any P, 
and the affinity is the half-turn C<> C’. But if there is another invariant point M, 
the affinity, transforming the triangle MCC’ into MC'C, is the reflection M(CC’). 

3. If the affinity is not a dilatation, it must transform at least one line a into a line a’ 
not parallel to a. Another line b, parallel to a, will be transformed into another line 
b, parallel to a’. The point of intersection A = a- a’ is invariant; for, if not, it would 
lie on an invariant line m and could be called either a+ m or a’: m, contradicting its 
noninvariance. Similarly, there is another invariant point B = b-b’. Therefore, the 
affinity is either a shear or a strain: ABC — ABC’. 

4. Since lines have linear equations, any transformation of the form 13.33 preserves 
collinearity. Conversely, we can express any affinity /Y Y -- /'X’Y’ in the form 13.33 
with suitable values for a, b, c, d, 1, m, For, the triangle (0, 0)(1, 0)(0, 1) is 
transformed into 


( m(a+hc+m(b+/,d+m), 


which can be identified with any given triangle. The non-collinearity of these three 
points is ensured by the condition ad ¥ be. 


5. (i) Translation, (ii) Central dilatation, 
(iii) Shear, (iv) Strain 
(including an affine reflection as the special case when a = —1). 
§13.4 


1. This follows from the remark after 13.41. 
2. If the diagonals PoP3, P,P, PoP, PsP, are parallel to the sides P,P,, PoP3, P3P, 
P,P), respectively, the following triangles all have the same area: 


PoP Po, PyPoP3, PoP3P4, P3PsPo, PaPoPi. 


Therefore, P,P, is parallel to PoP. 

3. When it is a translation or a half-turn. In fact, a central dilatation O(A) has 
ad — be = #. 

4. Always. 

5. Never. 

6. Each affine reflection reverses area. 

7. A translation is the product of two affine reflections in the direction of the 
translation, the mirrors being parallel lines in any other direction. More precisely, 
in the notation of Figure 13,2d, the translation A —» D is the product of reflections 
A(BC) and B(AD). 

A half-turn is the product of reflections in two intersecting lines, the direction of 
each reflection being along the mirror for the other: the half-turn 4 <> Bis the product 
of A(CD) and C(AB). 

A shear is the product of reflections in one mirror in two different directions. 
Alternatively, it is the product of reflections in one direction in two intersecting mirrors. 
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8. For any geometric transformation, the successive transforms of a noninvariant 
point Py comprise a set of points P) P,P, +++ called the orbit of Py; the transformation 
takes Py to P,, P; to Pz, and so on. Exercise 3 (on page 203), describing a situation in 
which the orbit of every point consists of a set of collinear points, shows that the only 
affinities of this kind are the “‘trivial’’ ones: the dilatations, shears and strains. For 
every other kind of affinity there is at least one noninvariant point Py lying on no 
invariant line; the line P,P, is transformed into a different line P,P,, the orbit begins 
with three points forming a triangle, and the affinity can be expressed as PyP Ps — 
P,P,P3. In the case of an equiaffinity, the “trivial” kinds are those considered in 
ex. 7. For any other kind, P,P, P, and P, P,P, are two triangles of equal area. Since 
they have a common side P,P,, ex. 1 shows that P,P, must be parallel to P, Ps. 

9. Since the translation, half-turn and shear have already been covered in ex. 7, we 
may restrict consideration to PoP, P2 > P, P,P, with PyPs parallel to P,P). Letting M 
denote the midpoint of PoPs, as in Figure 13.45, we see that this equiaffinity is the 
product of the two affine reflections 


R, = M(P,P,;) and ~—s Ry = Py( P,P) 
(compare §2.7 on page 34). For, these reflections have the effect 
PoP P. > PgP,P, > P,P.Ps. 


10. Since R, transforms the points +++ P)P|P,P3:++ into +++ P3P,P;Py)+++, while 
R, transforms +++ PyP)PP3P,:++ into +++ PyP3P,P)Py)-+*, the stated parallelism 
certainly occurs when i + j/ = h + k =3 or 4. For other values, we simply have to 
transform by a suitable power of the equiaffinity R,R». 

11. Because affine geometry cannot distinguish between a circle and any other ellipse. 
The elliptic shadow cast by a coin illustrates the fact that an ellipse is a “strained 
circle.” We are free to use a strain as a coordinate transformation, writing e« for x, 
so that the ellipse becomes 

eu + y= 


and the elliptic rotation 13.49 becomes 
, y, ji 3 
xz’ =xcos § — - sin 6, y = ex sin 0 + ycos @. 
€ 


This equiaffinity reduces to a half-turn when we set 9 = =. If instead weset 0 = 7 — ¢, 
e = a/(2d + 1), and make d tend to infinity, we obtain a new equiaffinity: the focal 


rotation 
ta ta 


CaP ag, Sy, 
which leaves invariant the pair of parallel lines 7? = 1. In other words, the affinely 
regular star polygons of type {2 + 1/d} (d = 2,3,4,...) may be regarded as approxi- 
mating either a digon (page 37) or a focal polygon, whose vertices 


(0, 1), (-1, —1), Q, 1), (-3, -1),... 


lie alternately on these two parallel lines while its sides pass alternately through the 
two ‘‘foci” (+ 4, 0). 

12. Every triangle is affinely regular, but the only affinely regular quadrangles are 
the parallelograms. 

13. Given Py, P;, P2, complete the parallelogram P)P,P,O. Then draw P,P; parallel 
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to P,O (with P, on O/P5), PsP, parallel to P,O (with P, on O/P,), and P,P; parallel 
to OP, (with P; on O/P,). 

14. It is important to remember that affine geometry admits no measure of angles. 
The symbol 6 occurring in 13.49 must not be interpreted as an angle but simply as a 
number. The use of sines and cosines does not force us to work in Euclidean geometry: 
they are employed because of their convenient properties, such as cos? 0 + sin? 0 = |. 
These functions can, of course, be defined by analytical means without any reference 
to geometry. After these words of caution, we take the typical vertex P; to have 
affine coordinates 

(cos /@, sin /@), 
where 6 = 2z/n, and conclude that 
PoPs sin 30 3 — 4sin? 6 
PP, ~ sin20—sin0  2cos6—-1 ea 

15. The values are 2,3, 4, 6 (as in §4.5 on page 60). This conclusion can be justified 
as follows. We see from §13.3 that the parallel-ruler enables us to multiply the length 
of a given segment by any rational number. In fact, given (0,0), (1,0), (0, 1), the 
points that can be constructed by means of the parallel-ruler are the points (, ) whose 
affine coordinates are rational, and no others. We see from ex. 8 that the nature of an 
affinely regular polygon Py P,P, «: : is determined by the position of P; on the line through 
Py parallel to P,P2. It is clear from ex. 10 that we can then construct P, and all the 
other vertices in turn, Exercise 14 shows that, for a polygon of type {n}, P3 can be 
constructed if and only if cos @ and 6/7 are both rational. 

The following trick for determining the admissible values of 8 was devised by the 
same H. W. Richmond who geometrized Gauss’s solution of the cyclotomic equation 
27 — 1 = 0 (Figure 2.15 on page 27). 

Since cos 26 = 2 cos? @ — 1, every rational cos @ yields a rational cos 20. Since 
6/7 is rational, the expressions 


cos 6, cos 20, cos40,..., cos2*6,... 


’ 


comprise a finite set of rational numbers. When these rational numbers are 
expressed as fractions in their ‘lowest terms,” let b be the greatest denominator that 
occurs, and let 


(f = 2*8) 


_a 
CoS 


be one of the numbers having this denominator. Since a and DB are relatively prime, 
the denominator of 
2a? — bF 

b? 
is either b? or (if b is even) 352. But this denominator must not be greater than b. 
Therefore, 


cos2¢ = 


b>iP >0, 6 <2, b=1 or 2,cos¢ =Oor £1 or +}, 


and the only admissible values for ¢ are /7/2 and j7/3 for integers j. Since cos (7/4) 
and cos (7/6) are irrational, it follows that the only admissible values for 0, with 
0 <6 < 7m, are 
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In our geometric application of this result, 6 = 2a/n < 7. Hencen = 2, 3,4 or 6. 
In other words, the only finite affinely regular polygons constructible with the parallel- 
ruler are the digon, triangle, parallelogram, and affinely regular hexagon. (For 
instance, the pentagram and pentagon shown in Figure 13.4e are nof constructible. It 
is impossible to assign rational coordinates to all the five vertices simultaneously.) 

The connection with §4.5 may be explained by observing that, in the formation of a 
crystal, Nature is, in effect, using a parallel-ruler to line up certain atoms in the 
straight rows of a lattice. 


§13.5 

1. Any common factor of « and «, would divide xy, — yw,. By 13.52, this is 
impossible. 

2. By 13.52, toy — yot = 1 = wy, — ya, and therefore 


(% + %)y = (Yo + Y)e. 


3. We can systematically assign the symbols 0, 1,..., 6 in cyclic order to the 
points of the basic lattice, as follows: each of A, B, C gets the label 0, and then we 
proceed with 1,2, 3,4, 5, 6, 0, 1,... to the right; from A toward L we have 0, 3, 6, 
2,5, 1,4, 0,...; and from B toward M we have 0, 5, 3, 1, 6,4, 2, 0,.... All the 
points numbered alike form a “sublattice,” and since there are seven such sublattices, 
each has a unit cell seven times as big as that of the basic lattice. 

Alternatively, let the basic lattice consist of the points whose affine coordinates are 
integers. Take Bat (0,0), Cat (2, 1), A at (—1, 3). Then the only lattice points inside 
the triangle ABC are (1, 1), (0, 2), (0, 1), forming a triangle of area } (that is, half a 
unit cell). By Pick’s theorem, the area of ABC is 3 +3 —1 =. 

3. (a) The triangle (0, 0) (3, 1) (—1, 4) has area + 6 — 1 = 4°, whereas the 
Cevians form an inner triangle of area § +0 —1 = 2. The ratio is ys. (b) The 
triangle (0, 0) (3, 2) (—2, 5) has area $ +9 — 1 = 1)’, whereas the Cevians form an 
inner triangle of area 3. 

5. The parallelogram (0, 0) (2, —1) (3, 1) (1, 2) has area $ + 4 ~ 1 =5, whereas 
the small parallelogram in the middle has area 1. 

The parallelogram (£6, £6) has area 144, whereas the small octagon in the middle, 
namely (+3, 0) (£2, +2) (0, £3), with 21 interior points, has area 24. 

1 Be 1 v 1 A 
“T1eel ele Pe ed 
uy +1) + (A + 1) + ACH + 1) 
A +(e + I +1) 
Apy +1 
A+ )H+D)0 +1 


7. This is obvious unless A, «, » are either all 21 or all <1. Assuming one of these 
eventualities, suppose, if possible, that LM/N is the smallest of the four triangles. Then 
Avy + 1 must be less than or equal to each of 


(A + dD», (we + 1A, (vy + De. 


By addition, 
3Auy +1) <A) +e DA +0 4+ De 
= (uy + A) + A + we) + (Au +), 


ANSWERS TO EXERCISES 445 


that is, 
(Aa- Dur D+ (ee - DOL~D +40 - DAH -D <0. 
Since A, x, » are all >1 or all <1, each of these three terms must be zero, so that 
A=eKL sy =], 
§13.6 

1. (i), (ii), and (iv) lack the additive identity (zero); (iii) contains | and / but not 
1 +7. The remaining four sets include zero, which has no multiplicative inverse. 

2. If B is the centroid of masses a at A and c at C, B’ is the centroid of masses a at 
A’ and c at C’. Points dividing 44’, BB’, CC’ in the ratio sc:1 are the centroids of 
masses | at A and «at 4’, 1 at Bandy at B’, lat Cand wat C’. Of these three points, 
the middle one is the centroid of masses a at the first and c at the last. 

In the concise notation of MObius’ barycentric calculus, we have 

B=aA+cC, B’ =aA' +cC’, 
B+ uB =a(A + HA) + c(C + eC’). 

3. In Mébius’ notation, the centroid of equal masses at the four vertices of the 
quadrangle ABCD is A + B + C + D. The vertices of the Varignon parallelogram 
are 4 +B, B+C, C+D, D+, and its center is (A +B) +(C + D) = 
(B+ C) +(D +A). 

4. Cutting the quadrangle along either diagonal, we obtain two triangles whose 
centroids are the midpoints of two of the broken line segments in Figure 13.6d. 

5. A centrally symmetrical quadrangle, that is, a parallelogram. 


§13.7 
1. Inside the triangle 4,A,A, we have + + +; beyond the side A,A,, ~ + +; 
and beyond the vertex 4,, + — —. 


2.;/1 0 1 
0olt 
x y il 


> —> — — 
3. 4(OS + OT) = 1(Zs; OA, + Et, OA) 
Eh(s; + 1,)OA. 


—_> —_> —_ 
4. o%s,OA, + tht,OA; = U(os; + 7t)OA;. 
5. In this formulation it is no longer necessary to assume Zs, = Ly. 
6&j0 1 A 
we 0 Ll = Aur +1. 
1 » O 
This has to be divided by (A + 1)(# + 1)(v + 1). When L, M, N are collinear, it 
becomes zero, in agreement with Menelaus’s theorem. 
7. When the line is entirely outside the triangle, the signs are all alike (say all plus). 
When the line penetrates the sides 4,4, and 4,4,, Ts differs in sign from T, and Ty. 


§13.8 
1. Any common point of a and b would be a common point of a and «. Apply 


13.82 to b, c, and a. 
2. Since a is parallel to b, it is parallel to the plane « through b, and we can use 


ex. 1. 
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3. Our proof of 13.81 shows that all the lines through 4 in the planeq’r’ are parallel 
to lines in the plane gr. 

4. The centroid of equal masses at Aj, Ap, Ag, Aq is the centroid of masses | at 
Ay, 1 at Ag, and 2 at the midpoint of A; Ay. 

5. (fy, to, f3, fy) is the centroid of masses f; at A; Vi = 1, 2, 3, 4). 


§13.9 
1. In the notation of Figure 13.85, if A’, B’, C’, O’ are the vertices opposite to 
A, B, C, O, the six sides of the skew hexagon AB’CA’BC’ can be joined to the diagonal 
OO’ to form a cycle of six tetrahedra, consecutive pairs of which are related by affine 
reflections; eg., the tetrahedra OO’ AB’ and OO'B'C are related by the affine reflection 
O0O'(AC), which leaves invariant the plane OO’B’ while interchanging A and C. 

2. An affine reflection interchanges pairs of points, P and P’, in such a way that all 
the joining lines PP’ are parallel and all the segments PP’ are bisected by the mirror 
(which, in the three-dimensional case, is a plane). 

3. The points (a, b, c) and (a’, b’, c’) are interchanged by the central inversion 
(%,y,2) > (a +a’ —x,b4+bh'—y,c +c — 2). 

4. If (, y, 2) is a lattice point lying in a first rational plane X¥x + Yy + Ze = +1, 
any common divisor of x, y, z would have to divide +1. 

5. No. For instance, (1, 1, 0) is a visible point in the 


‘ 


‘second” rational plane 


x+y =2. 
6. When = 1, wehave2y + 3z = —1. Two obvious solutions arey = 1,2 = —1, 
andy = —2,z2 = 1. Whenx = —4, wehave2y + 32 = 5, with the obvious solution 


y =z = 1, We thus obtain the triangle (, 1, —1)U, —2, I(—4, J, 1). 

7. The triangle (1, 1, —1)1, —2, 1)(—4, 1, 1), whose determinant is —1, is half a 
unit cell for the lattice in the plane 6% + 10y + 152 = 1, Hence the genera] lattice 
point in this plane-is 

(1,1, -—1) + m(O, —3,2) + n(—5, 0,2) = (1 — 5n, 1 — 3m, —1 + 2m 4 2n), 
where m and n run through all the integers. 

8. The given equation implies 2? + 2V2ay + 2y? = 322. Since V2 is irrational, 
any solution in integers woud require zy = 0 and a? + 2y? = 32?, which is impossible 
by the usual argument for establishing the irrationality of V3. 


§14.1 


1. By 14.11 the four points described in 14.13 are joined in pairs by six lines which, 
by 14,12, meet any other line in at least three points. Also each of the six lines meets 
the others in three points. 

2. The m points, with c lines through each, apparently make a total of cm lines; 
but in this estimate each of the w lines is counted d times, once for each of the d points 
on it. Therefore cm = dn. The Pappus configuration 9, may be regarded as a cycle 
of three “Graves triangles” in six ways. (Coxeter, Projective Geometry, 1964, p, 39.) 

3. Here is the table: 


12 11 10 9 8 7 6 > 4 3 2 1 0 
1 2 3 4 5 6 7 8 9 10 11 12 0 
2 3 4 5 6 7 8 10 11 12 0 1 
4 5 6 7 8 9 10 11 12 0 1 2 3 

10 11 12 0 1 2 3 4 5 6 7 8 9 
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The last column indicates that the points on py are Py, P,, Ps, Pg, and that the lines 
through Py are py, P;, Ps, Py. The other columns have an analogous interpretation. 
The columns of the table 
Py Pro Py Ps Ps P, Ps P; 
Ps P, Ps Ps Po 
P, Ps Py, Ps Ps Py Pr Py 
indicate the eight lines of the configuration 8, formed by the cycle of eight points 
PoPioPoPePsP2PgP3. The two mutually inscribed quadrangles are obtained by taking 
alternate points of this cycle. 
4. Through any one of the points we have p + | lines, each containing p further 
points. This makes 1 + (p + 1)p points altogether. 
5. The whole finite geometry provides a counterexample to refute Sylvester’s 
theorem. Every line joining two of the points belongs to the geometry and thus 
contains not only two but p + 1 of the points. 


§14.2 
1. Let 4, B, C, D be the points 14.23, and continue as follows: 
E = AD-BC =(0,1,1), F = BD- CA = (1,0, 1), 
G=AB-EF =(-1,1,0), H=BC-DG =(0,2,1), 
I =AD- FH = (1,2,2), fLEF CF SO, 2,3). 


2. The three pairs of opposite sides are 
% + %z = 0, tz ta, = 0, x, £% = 0. 

3. We see that P; = PoP3-PyP;, P,. = P,P;: PsP, The collinearity of PoP;P, 
makes x? +” +4+1=0, 
4. Py = PoP, P,P. = (0,1, 1), Py, = P,P5° PoP, = (1, 0, 1), 

P, = P,P;- PoP, = (1, 1, 0), Py = P\P;° PsP, = (1, 2, 1) 

P, = PoP5 + PsP, (2,1, 1), Py = PoP, PP = (1, 2, 0), 

Py = P,Po° PoP, = (0, 1, 2), Py, = PoP;* PoP, = U1, 1, 2), 

Pyp = PyP2: PP; = (2, 0, 1). 


The lines are as follows: 


© 


Poit, = 0. Pii%2 = 9. P2i% + #3 = 0. 
Psit. +% = 0. Par®, + ty + %y = 0. p5:®, + %y — %y = 0, 
Poi —% + Ug + %y =O. pet, + % = 0~ Pit, — % = 0. 
Poit, — X= 0, Proi% — %y + tg = 0. Prt, — % = 0. 
Prizi®, = 0. 


5. The points PyP,P,P3P,P;P, may be taken to be 
,0,0) (,1,0) (0,0,1) (,1,0) (,1,1) G,1,1)) G,90, 1. 
§14.3 
1. The points (1, 0, 0), (1, 1, 1), (p, 1, 1) all lie on the line x, = #3, We obtain 


(0,g — 1, 1 —r) by subtracting (1, 1, r) from (1, g, 1). 
3. S=Pi,, T=P;, U = V = F = Py, (The point Pg is not used.) 


§14.4 
1. The definition for a harmonic set involves A and B symmetrically, also C and F 
in the same way. 
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2. Draw any triangle RSP whose sides SP, PR, RS pass respectively through 
A, B,C, Let AR meet BS in Q. Then PQ meets AB in the desired harmonic conjugate. 

3. Taking RAB as triangle of reference, let C and S be (0, 1, 4) and (1, 1, 4). Then 
Q is (1,1, 0), Pis (1, 0, 4), and Fis (1, 1,0) — (1, 0, 4) = (0, 1, —4). 

4. On any line in PG(2, 3), there are exactly four points. The harmonic conjugate 
of any one of these with respect to any two others is a fourth point on the line and 
therefore can only be the fourth point on the line! 

5. The same harmonic set is determined projectively by the quadrangle, and affinely 
by dividing the segment 44’ internally at 4, and externally at Ag, in the same ratio. 


§14.5 


1. If x and x’ are corresponding lines of two perspective pencils, their point of 
intersection x - x’ continually lies on the axis 0. 

2. Any section of a harmonic set of lines is a harmonic set of points, and any har- 
monic set of points is projected by a harmonic set of lines. 

5. Whenever a projectivity on a line g is the product of two perspectivities, the 
join of the two centers meets g in an invariant point. 

7. If the given projectivity is an involution, say (4A’)(BB’), it is expressible as the 
product of the two involutions (4B)(A’B’) and (AB’)(BA’). If the given projectivity 
is not an involution, and A is any noninvariant point, the projectivity may be expressed 
as AA’A” 4 AA" A” (where possibly A” coincides with A); it is then seen to be the 
product of the two involutions 


(AA"\(A'A’) and (AA"\(A'A"), 


8. (i) (C11 — Cag)” + 4¢y2Ca1 = 0. 
(ii) cy, + Cog = 0. 


§14.6 


1. Let 4A’ be the given pair of corresponding points, collinear with the center O. 
Let AX meet the axis in C. Then the collineation takes AC to A’C and leaves invariant 
the line OX. Therefore X’ is the point of intersection of A’C and OX, 

2. In the notation of Figure 14.3a, consider the perspective collineation with center 
O and axis DE that transforms P into P’. When the construction in ex. | is applied 
to Q it yields Q’, and when it is applied to R it yields R’. 

3. Let two points A and _X, outside the line o of invariant points, be transformed 
into A’ and X’, Since 4A’ and XX’ are invariant lines, their common point O is an 
invariant point and therefore lies on 0. Hence all joins of pairs of corresponding 
points meet o in the same point O. 

4. In the notation of ex. 3, let O, be the harmonic conjugate of O with respect to 
Aand A’. Then the harmonic homologies with centers A and O, will have the desired 
effect, since the former leaves A invariant and the latter takes A to A’. 

5. Yes. For the unique projectivity P)P,Ps; A P,P,P, must transform the remaining 
point on P,P, into the remaining point on P,P, It is not necessary to give actual 
perspectivities, but in case they are desired, one possibility is 

PoP PPy = PyPsPpPig = PPP Pia 
P;, > Pg; is a projective collineation of period 3. 
6. (i) A homology with center (0, 0, 1) and axis 7; = 0. 
(ii) An elation with center (c,, cy, 0) and axis x3 = 0, 
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7. Consider a quadrilateral APYA,P,X,, as in Figure 14.65, with A conjugate to 
A, and P to P,. The polars a and p pass through 4, and Py, respectively. By 14.64, 
the polar triangles APX and apx are perspective from the line A,P,. Therefore «x 
passes through X,, and YX is conjugate to X,. 

8. The condition for the two points (0, 1, £1) to be conjugate is co» ~— cg, = 0; for 
(£1, 0, 1), c33 — cy = 0. These two conditions imply cy, — cg, = 0, which is the 
conjugacy condition for (1, £1, 0). 

9. The given bilinear relation makes *, = 0 the polar of (1, 0, 0), and 
®, +X, + % = 0 the polar of (1, 1, 1). Any self-conjugate point (x) must satisfy 
x," +4 29” + %3” = 0. This is impossible in the real field but happens for all the four 
points (1, £1, £1) in PG(2, 3). 


§14.7 


2, When B = D, we have = p,y = PQ =d,andx-y = P. 
When A = D, wehavey =g,x% =d,and«:y = Q. 

3. See Coxeter 2, pp. 88-89. 

7. The hint shows that the correlation P,; > p; is projective. Being obviously of 
period 2, it is a polarity, The triangle PyPyyPy2, whose sides are py, P19, Pre, is self-polar. 
Finally, since the residues 0, 7, 8, 11 are the halves (mod 13) of 0, 1, 3, 9, the points 
Py, Py, Pg, Py, (and no others) lie on their polars. 

Thus the four lines po, Pz, Ps, Pir are tangents, the six lines py, Po, Ps, Ps, Per Py ate 
secants, and the three lines py, P49, Pig are Non-secants, The three non-secants are the 
sides of the self-polar triangle P,P,9P,. which was used in describing the polarity. 
Since each non-secant is a common side of two self-polar triangles, there are three 
further self-polar triangles 

P4P5Py, PioP3P¢, Pi2P Po, 
each having for its sides one non-secant and two secants [like the triangle EHH of 
Coxeter 2, p. 82, Fig. 6.2C]. Each secant, containing only one pair of distinct con- 
jugate points, is a side of only one self-polar triangle. Hence the only self-polar 
triangles are the four already mentioned. 

Other geometries, such as PG(2, 5), admit self-polar triangles formed by three 
secants or by one secant and two non-secants. 

8. The sides of this hexagon are 


x, =0, Uy = X + Xs, ty =0, 
Uy + Xz = Qty, Hy + Hy = Hy, Hy + My = Hp. 
Opposite sides meet in the three points 
(0, 1, 2), (6, 1, 5), (2,0, 1), 


which all lie on the line 7, + 4%) = 224. 


§14.8 


1. Two distinct transversals from R would determine a plane containing both 
aand b. 

2. Let a, b, c be three skew generators. Let an arbitrary plane through a meet c in 
R. This plane contains also the generator Ra + Rb. 

3. The four lines A,B, all intersect one another, and since they are not all coplanar 
they must be concurrent. 
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4. Calling the centers of perspective C,, Cy, C3, Cy, we see that C,C,C,C, is per- 
spective with B,B,B,B, from Aj, with B,B,B,B, from As, with B,B, BB, from Ag, with 
B,B,B,B, from Aq, with A,A4,4,A4, from B,, and so on (with A’s and B’s consistently 
interchanged). 

5. Each point of PG(3, p) lies on p? +p + 1 lines, each containing p further 
points. Hence there are altogether 1 + p(p? +p +1) =p® +p? +p +1 points, 
and, by duality, the same number of planes. Each of the p® + p? + p + 1 planes 
contains p? + p +1 lines, but each line lies in p +1 planes; therefore the total 
number of lines is 

(P+pP+p tip +p +) 
pti 
This expression was obtained by Von Staudt in 1856. (See the footnote on p. 237. See 
also P. H. Schoute, Mehrdimensionale Geometrie, vol. 1, p. 5, Leipzig, 1902.) When 
p = 3, the number is 130. 


§14.9 

1. Because Euclidean geometry does not admit self-perpendicular lines. 

3. This is Clifford’s first theorem in its original form, which can be derived from the 
form given in our text by inversion in a circle with center S. 


§15.1 

1. By 15.11 there is, on the ray 4B, a point B’ such that CD = AB’, Thus we have 
AB = CD and CD = AB’, By 15.12, AB = AB’. But AB = AB, and both Band B’ 
are on the ray AB. Hence, by 15,11, B’ = B. 

2. Any triangle has an incircle, and the lengths of the tangents to it from 4, B, C 
are s —a, s —b, s —c, as in §1.5, ex. 3. We have to abandon all the formulas 
involving trigonometrical functions, but ex. 1 remains valid. Even an acute-angled 
triangle may fail to have a circumcircle. 


§15.2 


1. See Coxeter 3, p. 189. 

2. If two lines have a common perpendicular m, they are symmetrical by reflection 
in m, Any point of intersection on one side of m would yield another on the opposite 
side, contradicting 12.2511. 


§15.3 


1. The plane through / perpendicular to the plane ABC meets the latter in a line m 
which may intersect / or be parallel or ultraparallel to /. In the first case, all the planes 
Al, Bl, Cl pass through the point of intersection. In the second case, they pass through 
the common end of / and m. In the third case, by 15.26, / and m have a common 
perpendicular EF, and the planes A/, B/, Cl are all perpendicular to the plane through 
EF perpendicular to /. 

2. By 15.23, p and r are parallel. Therefore the product of reflections in them is a 
parallel displacement. The first reflection leaves J invariant; the second transforms J 
into L. 


§15.4 


1. A tetragonal rotation about the front vertex on the left, a trigonal rotation about 
the center of the face d, the half-turn about the line joining the midpoints of two 


= (p? + 1)(p? +p +1). 
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opposite edges, and the half-turn about the line joining two opposite vertices: (ab)(cd) = 
(acbd)’. Not counting the identity, we have 6 + 8 + 6 +3 = 23. 

2. The cyclic group C, has two “sets of poles,” each consisting of just one pole. 
By adding an ineffective ps-gonal pole with pg = 1, we are able to include this group 
as a “trivial” solution of 15.42. 


§15.5 
I. (a) Co. : 
(b) Don Dyn (n even), D, x {I} (n odd). 
2. CgCo. 


§15.6 
(a) Dy x {I}. (b) Ds x {Tf}. (c) Ay x {I}. 


§15.7 


I. The vertices of {p, 2} are p points evenly spaced along a great circle (say, the 
equator), and its edges are the p arcs that join neighboring vertices. The vertices 
of {2, p} are two antipodal points (say, the north and south poles), and its edges are 
evenly spaced great semicircles (meridians). 

2. The tetrahedron has six planes of symmetry, each joining an edge to the mid- 
point of the opposite edge. The cube and octahedron have nine, one parallel to each 
pair of opposite faces of the cube and one joining each pair of opposite edges. The 
dodecahedron and icosahedron have fifteen, one joining each pair of opposite edges. 

3. 4n/ (- + 3 + ; - *). Two corresponding edges of the blown-up {p, q} and 
{q, p} perpendicularly bisect each other at the common right angle of four specimens 
of the fundamental region. Accordingly, it is natural that the order of the group 
should turn out to be 4£, 


§15.8 
I. If the mth radius is k,, we have ky_yknj1 = ky. 
3. Using the abbreviation 
JP seo 3 Sint 6s? — = ie 
Pi P2 P2 Pi 


(cf. 10.42), we find that the radius and distance are 


10am d 1 7 
7 ok an —cos—.- 
ko Py k Pe 


§16.1 
If AB and AM, respectively, are perpendicular and parallel to r, as in Figure 16.3a, 
we have an acute angle at A and a right angle at B, and yet the rays do not meet. 


§16.2 


I. In the projective model, points and lines are represented by points and lines. 
Therefore isometries are represented by collineations. Since parallel lines are trans- 
formed into parallel lines, these collineations must transform « into itself. Since a 
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reflection leaves invariant every point on the mirror, the corresponding collineation 
must be an elation or a homology (§14.6). Since it is of period 2, it can only be a 
harmonic homology. Since it preserves , its center is the pole of the mirror (i.e., the 
point of intersection of the tangents at the ends of the chord). 

When the mirror is represented by one of the vertical circles, the reflection appears 
as the inversion in the sphere, through this circle, orthogonal to Klein’s sphere. 

2. A pencil of concentric circles may be described as the orthogonal trajectories of 
a pencil of lines. In the conformal model these lines appear as circles through a pair 
of inverse points (with respect to ©). Therefore the circles belong to the orthogonal 
pencil of coaxal circles (including 8 and having the pair of inverse points for limiting 
points). 


§16.3 


I. In the projective model, the common perpendicular to two ultraparallel lines 
joins their poles with respect to @, and the common parallel to two rays joins their 
ends. 

2. For any point G on A/B, we have 7MAG > /MBA. But (DAM = EBM. 
Therefore “DAG > /EBA = / BAD. 

3. By considering congruent right-angled triangles, we see that AD = CF = BE. 
Since the angle C of the triangle ABC is equalto “CAD + EBC, the sum of all 
three angles of the triangle is 

ZLBAD + LEBA, 


that is, the sum of two (equal) acute angles. 

4. This is a generalization of the theorem that the altitude lines of a triangle are 
concurrent (which is a corollary of Fagnano’s problem). The simplest proof uses the 
projective model and refers to Chasles’s theorem (14.64): Any two polar triangles are 
perspective triangles. 

5. Either a translation or a glide reflection, according as the triangles are, or are 
not, on the same side of their common side. 

6. To a circle of the indicated radius, draw tangents at the ends of three radii 
making angles 120° with one another. These tangents form a trebly asymptotic 
triangle. 

7. Draw the Cevian through the given point and compare Figure 16.3a. 


§16.5 


3. Consider how successive translations along CA, AB, BC will affect the side CA 
of the triangle ABC. The first translation slides this segment CA along itself to a 
position AX. The second (along AB) takes this to BY, where ZABY = A. The third 
(along BC) takes this to CZ, where 7ZCB = 7 — ZCBY = 7 — (A + B), so that 
L£ZCA =a —~A—B-—C. (This result can evidently be extended from triangles to 
higher polygons.) 

4. Consider how successive half-turns about the midpoints DA, AB, BC, CD will 
affect the side DA of the quadrangle ABCD. The first half-turn reverses this side, 
yielding AD. The second (about the midpoint of AB) takes this to BX, where 
ZABX = A, The third (about the midpoint of BC) takes this to CY, where 2 BCY = 
A + B. The fourth (about the midpoint of CD) takes this to DZ, where 2CDZ = 
A+B+C,sothat 2ZDA =27 ~A-—B-C-D. 

5. At any vertex we find one specimen of each angle of the polygon, in natural 
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order. The cycle may be repeated any number of times (if the polygon has a sufficiently 
large area). 


§16.6 

I. Compare §16.3, ex. 3. The perpendicular bisectors of the sides of the triangle 
may be either intersecting or parallel or ultraparallel. 

2. The external bisectors of two angles of the triangle may be either intersecting or 
parallel or ultraparallel. 

3. The horocycle is symmetrical by reflection in any diameter. The diameter r 
reflects J into L. 

4. Two. Their centers are the two ends of the perpendicular bisector of the segment 
joining the two points. 

5. Remember that an equidistant curve has two branches. 

7. Use §16.3, ex. 2. 


§16.7 


I. An equidistant curve. 

2. The additive property, described in §6.6, ex. 5, shows that hyperbolic distance 
is propertional to inversive distance. The factor of proportionality is a matter of 
convention, like the value ~ = 1 that led to 16.53. 


A 
E A 


i) 
ae 


M 
—> 
B M 2 oO Q 


Figure 16.7a 


3. The following proof of Lobachevsky’s famous formula is credited to Paul Sz4sz 
{see Coxeter, Aunali di Matematica, pura ed applicata, (4), 71 (1966), p. 82]. In Figure 
16.7a, the segment AB of length « is represented by the part AB of the line perpendic- 
ular to Q at Q. The circle through B with center Q represents the line through B 
perpendicular to AB, and the tangent circle through A with center O (also on Q) 
represents a parallel line having the same end M. The angle of parallelism 


£ =II(x) = /BAM 
appears as the angle at A in the ‘“‘curvilinear triangle’ BAM, and again as / QOA. 


Since x is the inversive distance between concentric circles with radii QB and QA (the 
latter not drawn), we have 


whence 
= 2 arctane®. 
Any reader who dislikes using Euclidean trigonometry to obtain a hyperbolic result 
may turn to page 377 for a purely hyperbolic proof. 
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§16.8 
By all the circles through one point (the point of contact). 


§17.1 


I. The vectors a and ¢ must be either parallel to each other or both perpendicular 
to b. 
2. (a x b) x (c x d) = [acd]b — [bed]a = [abd]e — [abc]d 
3. (a x b)-(a x b) = (a-a)(b-b) — (b+ a)(a > b) 
= |al?|b|? — |al?|b|? cos? 6 
= |al?|bI? sin® 0 
= lax bl? 
§17.2 
The velocity is in the direction of the tangent. The acceleration is directed towards 
the center along the radius. 


§17.3 

I. «© =u —sinu, y = —}! —cosu. This is, of course, a congruent cycloid. 

2.4% =cosu + usinu,y =sinw — ucosu. This isa kind of spiral (but, of course, 
not an equiangular spiral). 

3. Since r = s cos @. 
§17.4 


Il. At the origin. 
4. (a)s =4sin y. 
(b) s = h(cse y cot y + log tan dy). 
5. Since u = log (cosh w + sinh u) = log (sec y + tan ¥), 


[sec y dp = log (sec y + tany) + C. 
§17.5 


p =asinhu, At the cusp it is zero: the curvature is infinite. 


§17.6 
I. Differentiating r? = 1 twice, we obtainr-t = 0 andr-p + p = 0, whence 
(r + pp)-t = (c + pp)-p = 0. 
2. Because t 4 (r + pp) = 0. 


§17.7 
aw z 
2. The helicoid — = tan-. 
Ho c 
c 
3.-=1. 
a 
§17.8 
2 «k= : 


~ 301 + eR 
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§17.9 


lv =ap"cosu, y =arnsinu, 2 =cph. 
3. The angle is arccot (V1 + c/a? log 1). 
5. The cylinder based on an equiangular spiral. 


§18.1 
J{rtr’r?] = Jr! + (r? x 3) = (ry X rg) + (4? x 4°) 
= (rg °P*)(rg +0?) — (rg r?)(rg° rr?) = 1. 
§18.2 
2. uv = gyWlel + goot?v® + gagiPr® 
+ Syo(ulv? + wel) + gog(u?v® + ubv®) + g—,(uPe! + uv), 
5. det e? = G1. 


§18.3 
2. Sap = Sap +1, gt? = 46% —1, where 5, = OF SoZ, 


§18.5 

3. La? =1, Ly z, = 0. 

4.81 =§s =1, S22 =('Y, Sag = 0% # A). 

5. ou = (WY), Gon = WP sin)’, g93 = 1, Sap =O (% ¥ B) 
1 22 y? 2 

& Sap = ila — w)(A — uF) bi (B — u*)(B — 1) - (C — W)C - a) 
§18.6 

I. DEA y oz, 


§19.1 
2. Uri xr; = Vlg, xr, = Ve Do gen = 0. 
The triangle formed by r' and r, has the same area, apart from sign, as the triangle 
formed by r? and ry. 
3. = 1/1 +22), 0c? = re/()?. 
4.9 =g" =1, gy = sin? a, 
grease, g,=g =O ¥)). 


§19.2 


2. tang = Vgigyo. 
4. cos 6 = cos Ocos (¢ — 6) — sin @sin (¢ — 6) 


2ql 
aa, aa 


"ise 8s 
a 1 2 a 11 12 
Eee + 20a") ~ pp a + gay) 
1 
18: 1 
— 2 


. 8182 
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5. g,a) = goa". 
6. The net consists of the parametric curves, which are orthogonal if g” = 0. The 


identity g4¢,, — geo. = 0 is in agreement with the fact that the internal and external 
bisectors of an angle are perpendicular. 


a faz 
7S -| [ sin 2 du! du?. 
0 0 


§19.3 
lot xr? =n/ Vg. 
2. b= —1, by = —sin?t, 5; =0 (i ¥ j). 


§19.4 
lH =0. 
2. H = {2,01 + 2,2) + lz, }/2 + 2,2). 
3 Wada, wl = bo. 


5. At an umbilic. 


§19.5 

32 4+ =k, 

4.18 = +csinh (2 — &). 
§19.6 


I. At an umbilic, 19.52 is an identity. 

2. The expression is a perfect square. 

3. At an umbilic, K = «? > 0. 

4. The conditions b,,: 5,9! bey = 211'&12'£22 become 

—sin® 4?:sin uw cos wu! sin w cos u?: —sin? u} 

= 2 sin? 2 + cos? u! sin? u?:sin u! cos wv! sin uv? cos u?:2 sin? u? + sin? u3 cos? 1, 

5. No. When there is a curve of umbilics, this curve is itself a line of curvature, and 
the only lines of curvature that cross it do so at right angles. 


§19.7 


I. Byg = 0. 


3. The lines of curvature are the intersections of the ellipsoid with the other quadrics 
of the system. 


a(R14)i5 M, = C55 lS ee 
2. Nii = 4 Luidis Ree —3 (8:3, Nia = B(Sia)s5 V4, = Vij .0lS iene 


3. Tieng =u, Toe, = vw, Vi, = 1, Vk, = —u'; all others are 0. 
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5. By the equations just before 19,33, 
Vert = Len x rp, 


Hence Vgili, = Vgirt en; = 2D (nr, r5] 
=n Dder, X ry = M+ (FX Fy); 
=n (Vg n)i = (V8); 
6. K =1. 
§20.2 
3. No; the tangents do not make a constant angle with the z-axis. 
§20.3 
2. (V8) = —V8- 
§20.4 


2. Another circle of radius 6 and one of radius a — b. 


§20.5 


(i) 27sinr, (ii) 27 sinh r. 


§20.7 
cosh? wt + (wu? + c)? = k?, 


§21.1 
Yes; it forms a map of three hexagons on the torus. 


§21.2 


2. Let P,P)P;P,P;P, be a regular hexagon concentric with and interior to the disk. 
Join P,P, P2Ps, PsP, through the boundary of the disk rather than through the center. 

3. They form the Thomsen graph. 

4. Yes, if q > 0. Both are nonorientable with y = 2 — 2p — q. 


§21.3 


I. {2, 1} may be drawn as a great semicircle joining 2 antipodal vertices. {1, 2} may 
be drawn as a great circle with one point on it specified as a vertex. 

2. {3, 5}/2 has 6 vertices, each joined to every other. The vertices and edges of 
{5, 3}/2 form the Petersen graph [Ball 1, p. 225]. 

5. {4, 4}, has 2 quadrangular faces, 4 edges, and 2 vertices; each vertex belongs 
to all 4 edges. 

{4, 4}o.9 has 4 quadrangular faces, 8 edges, and 4 vertices; each vertex belongs to 
all 4 faces. 

{3, 6},,, has 6 triangular faces, 8 edges, and 3 vertices; each vertex belongs to all 
6 faces. 

{6, 3},,, has 3 hexagonal faces, 9 edges, and 6 vertices; each vertex belongs to all 
3 faces. 

{6, 3}2,9 has 4 hexagonal faces, 12 edges, and 8 vertices. 
All 5 maps are of genus 1. 

7. V = 3, 4, 4, 5, 6, 7. 
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9. The positive integers p and q are not quite arbitrary. If one of them is |, the other 
can only be 2. For instance, gq = 1 imples E = pr, F = 2r, whence 


E+F =(p+2)r = (2p + 2q —-pgr =% <2,E=F=1,p =2. 
§21.4 


I. The cube in only one way; the dodecahedron two ways. 


§21.6 
x =2, L, 0, als —2, 33, —4, —5, —6, =u, —8, =: 


IN])=4, 6, 7, 7, 8 9, 9, 10, 10, 10, 11, U1. 


§22.1 
Il. rN, = E’Ng, where, by ex. 1 at the end of §10.3, 
1 1 1 1 
E’ q ue aa 


Similarly, pN2 = EN3. 

2. This is derived from the analogous cube in the space x, = 0 by translating it 
through distance 1 along the fourth dimension. 

3. The cube (41, +1, 41, —1) is translated through distance 2 along the fourth 
dimension. 

4. (1,1, 1, 1). 


§22.2 


y 7 oT 7 7 Ne 7 je 
I. Since cos— <sin—, — +— >>. Similarly, cos— <sin—. 
PP q 2 q r 


2. (3, 3, 4}. 


§22.3 


I. (+1, 0,0, 0), (0, £1,0,0), (0,0, £1,0), (0,0,0, £1). 
2. (+1, +1, 0, 0), permuted. 

3. (7,1, 77, 0) = 7 %(7, 0, 7, 0) + r1(7, 7, 0, 0). 

4. The extra points correspond to the centers of the 24 icosahedra. 


§22.4 


I. No. 

2. (a) No. (b) Yes. 

3. Yes. The twelve can have their centers at the vertices of a regular icosahedron. 
4.12 +22 4-++ 40 =n(n + Qn 4+ 1/6. 


G)+G)ro+('2')- (3) 
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Abstract group, 32, 48 
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Addition, 136, 138 
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Affine collineation, see Affinity 

Affine coordinates, 202, 223, 339-340 

Affine geometry, 175, 191-230 

Affine reflection, 203, 207-208, 225 

Affinity, vii, 202-207, 225, 441 

Algebraic number, 138 

Alhambra, 58 

Alias and alibi, LL 1-112 

Alternating group A», 271 

Alternating symbol, 341 
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Amplitude of a complex number, 141 

Analytic geometry, 107-134, 138-141, 202-228, 
234-239 
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of parallelism, 291-293, 303, 376-377, 453 
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Angular defect, 296-299 

Angular region, 183 
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Apple core, 35 
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Axes of a conic, 117 
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Binomial coefficients, 168, 183, 186 
Binormal, 322 

BirkHorr, G., 31, 140, 144, 199, 236, 307, 320 
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Cardioid, 114, 119 

CARROLL, Lewis, 4, 26, 39, 45, 50, 67, 137, 150, 
387, 389 

CarsLaw, H. S., 267-268, 287-295, 376 

Cartesian coordinates, 43, 108-110, 158-159 

Casey, J., 16, 74, 75, 92 


INDEX 


Categorical system, 287, 289 
Catenary, 317-321 
CAYLEY, A., 230 
Cell, 186 

see also Unit 
Center, of a conic, 117 

of curvature, 314 

of similitude, 70-71, LOL 
Central conics, 112-113, 117 
Central dilatation, 194 
Central inversion, 98, 277 
Central projection, 93 
Centroid, 10, 109, 214-217 
Crva, G., 191, 220 
Cevian, 204 
Characteristic, see Euler-Poincaré 
Characteristic tetrahedron 09010203, 155 
CHASLES, M., 96, 249, 452 
CHEPMELL, C. H., 24 
Chessboard, 50, 411 
CHILTON, B. L., 402~403 
CHRISTOFFEL, E. B., 366-375 
Cuurcu, A. H., 169, 172 
CIAMBERLINI, C., 421 
Circle, 14, 132-133, 262, 300 

of Apollonius, 88-89 

of curvature, 314 

of similitude, 90 
Circle-preserving transformation, 91, 146, 290 
Circular helix, 170, 323-325 
Circumcircle, 12, 262 
Circumradius, 155-157 
Circumsphere, 155, 273,-274 
CLEMENS, S. L., see Twain 
CLIFFORD, W. K., 72, 262 
Close packing of spheres, 405-412 
Coaxal circles, 85-89, 114 
Coaxal equidistant curves, 300 
Cog, C. J., 310 
COHN-VOSSEN, S., 53, 54, 83, LOL, 148, 152, 234, 

355, 363, 365, 369, 382, 384, 402, 406-408 
Colatitude, 343 
COLEMAN, A. J., 354 
Collinear points, 65, 178 
Collineation, 247 

see also Affine, Equiaffine, Perspective, Pro- 

jective 
Combinatorial topology, 379-395 
Common parallel, 292 
Common perpendicular, 293 
Commutative law, 237, 269 
Compasses, 79 
Complete quadrangle, 19, 230 
Complete quadrilateral, 231 
Complete V-point, 389 
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Complex numbers, 135-147 

Compound interest, 127 

Compound polyhedron, 158 

Concentric horocycles, 300, 378 

Concho-spiral, 172, 326-327 

Cone, 129-130 

Configuration, 233, 237-239 

Confocal quadrics, 340, 362-363 

Conformal geometry, 84, 92 

Conformal models of hyperbolic geometry, 289, 
302-304 

Conformal transformation, 145-147, 203 

Congruence, 5, 198, 263-268 

Congruent transformation, see lsometry 

Conic, 15-118, 252-255, 260 

Conjugate axis of a hyperbola, 117 

Conjugate complex number, 142 

Conjugate hyperbolas, 364 

Conjugate lines, points, 248 

Consistency, 229, 287-290 

Continued fraction, 146 

Continuity, 5, 103, 187 

Contravariant components, 329 

Convexity, 53, 148, 183-186 

Coordinates, see Affine, Areal, Barycentric, 
Cartesian, Cylindrical, General, Homoge- 
neous, Polar, Tangential 

Coordinate transformation, 111-119, 218-219 

Copenhagen’s Stock Exchange Building, 327 

Correlation, projective, 248 

Correspondence, see Transformation 

Corresponding points on parallel rays, 266 

cosh, 118, 124 

CoTEs, R., 144 

CouRANT, R., 23, 79, 123 

Court, N. A., 9, LO, 12, 17, 23, 69, 70, 71, 89 

Covariant components, 329 

Cox, Homersham, 258-262 

CRAMER, G., 330 

Critical lattice of a sphere, 335-337 

Cross-cap, 383-385 

CrowE, D. W., 294 

Crystal classes, 278-279 

Crystallographic restriction, 60-61, 444 

Crystallography, 50-61, 225-228, 278-280, 332- 
334, 413 

Cube (4, 3}, 4, 149-151 

Cubic close packing, 407-411 

Cubic graph, see Trivalent 

Cubic honeycomb {4, 3, 4}, 399, 407, 414 

Cunpy, H. M., 28, 165 

Curvature, 15, 314, 322 
see also Center, Circle, Normal curvature, 
Principal curvatures 


461 


Cyclic group Cy, 32, 270 

Cycloid, 49, 317, 319 

Cyclotomy, 26-28, 141-142, 443 
Cylinder, 129-130 

Cylindrical coordinates, 323, 337-338 
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DanssE, A. E., 168 

Darsoux, G., 321, 324-326, 361 

Darwin, C., 407 

Davenport, H., 335 
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DEDEKIND, R., 187, 200, 201 

Definite quadratic form, 119 

Degenerate conic, 255 

Degree of a graph, see Valency 
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DE LA VALLEE Poussin, see La Vallée Poussin 
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Density of a polygon, 36, 38 

Derivative of a vector, 312 

DESARGUES, G., 93, 192, 229, 238-240 

DESCARTES, R., vii, 108, 125, 154, 422 

DE SITTER, W., 299 

Developable surface, 354 

Diagonal points, 19 

Diameter of a horocycle, 300 

Differential geometry, 307-378 

Digon {2}, 37 

Dihedral angle, 35, 150, 157, 400, 413 

Dihedral group D,, 32, 271 

Dihedron {p, 2}, 282, 386 

Dilatation, 68, 79, 101, LLO-LL1, 136, 193-203, 
399 

Dilative reflection, 72-76, LLO-L11, 147 

Dilative rotation, 72-76, 101-104, LLL, 125, 133, 
139, 145, 164-165, 317, 326-327 

Dipyramid, 150, 280 

Direction cosines, 128 

Direct isometry, 39, 96-101, 269 

Directly congruent figures, 39 

Directly similar figures, 73 

Direct product of groups, 49 

Directrix, 116 

Direct similarity, 72-74 

DirIcu_LeT, G. L., 54, 334 

Dirichlet region, 53~57, 169-170 

Discrete group, 283 

Disk, 385 


Displacement, 107 
see also Direct isometry 
Dissection, 152 
Distance, 220 
Distributive law, 309-310 
DyorpgeEvic, R. Z., 419 
DNA, 278 
Dodecagram {12/5}, 37 
Dodecahedron, see Regular dodecahedron, 
Rhombic dodecahedron 
Dopason, C. L., 6, 295, 387 
see also Carroll 
Dominoes, 60 
DONCHIAN, P. S., 402, 404 
Donkin, W. F., 269, 299 
Dorrie, H., 211-212 
Doubly asymptotic triangle, 294-299 
D-stix Pre-engineering Kit, 412 
Dual bases for a vector space, 328 
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topological, 386 
see also Reciprocal polyhedra 
Dual maps, 379, 386 
Dupeney, H. E., 27 
Dupi, C., 361-365 
Dupin’s indicatrix, 363-365 
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Eccentricity, 116 
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Eight-color problem, 395 
EINSTEIN, A., 176, 299 
Elation, 247, 251 
Electromagnetic theory of light, 256 
Element of arc, 121 
Elements, the four, 4 
Elements of Euclid, 3, 175 
Ellipse, 94, 116, 118 
Elliptic geometry, 94-95, 375, 386 
Elliptic paraboloid, 130 
Elliptic polarity, 252 
Elliptic projectivity, 244 
Enantiomorphism, 276-279 
End of a line, 266 
Enneagon (9}, 25, 28 
Enocn, 41 
Envelope, 252, 315 
Ephesians, 397 
Epicycloid, 114 
Equation, of a circle, 43, 110, 113 
of a conic, 116-119 
of a_line, 108 
Equiaffine collineation, see Equiaffinity 
Eauiaffinitv vii. 206-208 975 449 


Equiangular spiral, 125-127, 142, 165, 316 
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FRickE, R., 50 

Frieze patterns, 47-49 

Froth, 409-412 

FRUCHT, R., 388 

Fundamental group of a surface, 382 

Fundamental magnitudes, of the first order, 344 
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Hyperbolic projectivity, 244 
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Hypercube {4, 3, 3}, 398, 414 
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Hyperplane, 186 
Hyperspace, 185-186 
Hypocycloid, 114 


1, see Central] inversion 
icosahedral group As, 273 
Icosahedron (3, 5}, 4, 149-151, 162-163, 432 
Ideal elements, 83, 93 
Identity, the, 29 
Image, reflected, 7 
Incenter, 12, 265 
Incidence, 230 
Incircle, 12 
Indefinite quadratic form, 119 
Index of a subgroup, 199 
Indicatrix of Dupin, 363-365 
INFELD, L., 144, 213 
Infinity, 83, 93, 116 
Inflectional tangents of a surface, 354 
Initial line, 110 
Inner product of vectors, 308 
Inradius, 12, 155-157 
Insphere, 155 
Integer, 136 
Integral curvature of a geodesic triangle, 372 
Intermediacy, 5, 176, 266 
Intersecting pencil of coaxal circles, 85 
Interval, 178 
Intrinsic geometry on a surface, 375 
Invariant, 29, 69 
Inverse, 29, 78 
Inversely similar, see Oppositely similar 
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in a circle, 77-91, 114, 142, 145, 283-286 
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space, 92 
Involute, 316 
Involution, 245-247 
Irrational number, 4, 137, 200 
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